SBC logo Authors: Amine Heddad, Andrea Krings, Markus Brameier and Bob MacCallum, Stockholm Bioinformatics Center, Stockholm University, Sweden.

NucPred

Fetching P20929 from www.uniprot.org...

The NucPred score for your sequence is 0.87 (see score help below)

   1  MADDEDYEEVVEYYTEEVVYEEVPGETITKIYETTTTRTSDYEQSETSKP    50
51 ALAQPALAQPASAKPVERRKVIRKKVDPSKFMTPYIAHSQKMQDLFSPNK 100
101 YKEKFEKTKGQPYASTTDTPELRRIKKVQDQLSEVKYRMDGDVAKTICHV 150
151 DEKAKDIEHAKKVSQQVSKVLYKQNWEDTKDKYLLPPDAPELVQAVKNTA 200
201 MFSKKLYTEDWEADKSLFYPYNDSPELRRVAQAQKALSDVAYKKGLAEQQ 250
251 AQFTPLADPPDIEFAKKVTNQVSKQKYKEDYENKIKGKWSETPCFEVANA 300
301 RMNADNISTRKYQEDFENMKDQIYFMQTETPEYKMNKKAGVAASKVKYKE 350
351 DYEKNKGKADYNVLPASENPQLRQLKAAGDALSDKLYKENYEKTKAKSIN 400
401 YCETPKFKLDTVLQNFSSDKKYKDSYLKDILGHYVGSFEDPYHSHCMKVT 450
451 AQNSDKNYKAEYEEDRGKGFFPQTITQEYEAIKKLDQCKDHTYKVHPDKT 500
501 KFTQVTDSPVLLQAQVNSKQLSDLNYKAKHESEKFKCHIPPDTPAFIQHK 550
551 VNAYNLSDNLYKQDWEKSKAKKFDIKVDAIPLLAAKANTKNTSDVMYKKD 600
601 YEKNKGKMIGVLSINDDPKMLHSLKVAKNQSDRLYKENYEKTKAKSMNYC 650
651 ETPKYQLDTQLKNFSEARYKDLYVKDVLGHYVGSMEDPYHTHCMKVAAQN 700
701 SDKSYKAEYEEDKGKCYFPQTITQEYEAIKKLDQCKDHTYKVHPDKTKFT 750
751 AVTDSPVLLQAQLNTKQLSDLNYKAKHEGEKFKCHIPADAPQFIQHRVNA 800
801 YNLSDNVYKQDWEKSKAKKFDIKVDAIPLLAAKANTKNTSDVMYKKDYEK 850
851 SKGKMIGALSINDDPKMLHSLKTAKNQSDREYRKDYEKSKTIYTAPLDML 900
901 QVTQAKKSQAIASDVDYKHILHSYSYPPDSINVDLAKKAYALQSDVEYKA 950
951 DYNSWMKGCGWVPFGSLEMEKAKRASDILNEKKYRQHPDTLKFTSIEDAP 1000
1001 ITVQSKINQAQRSDIAYKAKGEEIIHKYNLPPDLPQFIQAKVNAYNISEN 1050
1051 MYKADLKDLSKKGYDLRTDAIPIRAAKAARQAASDVQYKKDYEKAKGKMV 1100
1101 GFQSLQDDPKLVHYMNVAKIQSDREYKKDYEKTKSKYNTPHDMFNVVAAK 1150
1151 KAQDVVSNVNYKHSLHHYTYLPDAMDLELSKNMMQIQSDNVYKEDYNNWM 1200
1201 KGIGWIPIGSLDVEKVKKAGDALNEKKYRQHPDTLKFTSIVDSPVMVQAK 1250
1251 QNTKQVSDILYKAKGEDVKHKYTMSPDLPQFLQAKCNAYNISDVCYKRDW 1300
1301 YDLIAKGNNVLGDAIPITAAKASRNIASDYKYKEAYEKSKGKHVGFRSLQ 1350
1351 DDPKLVHYMNVAKLQSDREYKKNYENTKTSYHTPGDMVSITAAKMAQDVA 1400
1401 TNVNYKQPLHHYTYLPDAMSLEHTRNVNQIQSDNVYKDEYNSFLKGIGWI 1450
1451 PIGSLEVEKVKKAGDALNERKYRQHPDTVKFTSVPDSMGMVLAQHNTKQL 1500
1501 SDLNYKVEGEKLKHKYTIDPELPQFIQAKVNALNMSDAHYKADWKKTIAK 1550
1551 GYDLRPDAIPIVAAKSSRNIASDCKYKEAYEKAKGKQVGFLSLQDDPKLV 1600
1601 HYMNVAKIQSDREYKKGYEASKTKYHTPLDMVSVTAAKKSQEVATNANYR 1650
1651 QSYHHYTLLPDALNVEHSRNAMQIQSDNLYKSDFTNWMKGIGWVPIESLE 1700
1701 VEKAKKAGEILSEKKYRQHPEKLKFTYAMDTMEQALNKSNKLNMDKRLYT 1750
1751 EKWNKDKTTIHVMPDTPDILLSRVNQITMSDKLYKAGWEEEKKKGYDLRP 1800
1801 DAIAIKAARASRDIASDYKYKKAYEQAKGKHIGFRSLEDDPKLVHFMQVA 1850
1851 KMQSDREYKKGYEKSKTSFHTPVDMLSVVAAKKSQEVATNANYRNVIHTY 1900
1901 NMLPDAMSFELAKNMMQIQSDNQYKADYADFMKGIGWLPLGSLEAEKNKK 1950
1951 AMEIISEKKYRQHPDTLKYSTLMDSMNMVLAQNNAKIMNEHLYKQAWEAD 2000
2001 KTKVHIMPDIPQIILAKANAINMSDKLYKLSLEESKKKGYDLRPDAIPIK 2050
2051 AAKASRDIASDYKYKYNYEKGKGKMVGFRSLEDDPKLVHSMQVAKMQSDR 2100
2101 EYKKNYENTKTSYHTPADMLSVTAAKDAQANITNTNYKHLIHKYILLPDA 2150
2151 MNIELTRNMNRIQSDNEYKQDYNEWYKGLGWSPAGSLEVEKAKKATEYAS 2200
2201 DQKYRQHPSNFQFKKLTDSMDMVLAKQNAHTMNKHLYTIDWNKDKTKIHV 2250
2251 MPDTPDILQAKQNQTLYSQKLYKLGWEEALKKGYDLPVDAISVQLAKASR 2300
2301 DIASDYKYKQGYRKQLGHHVGFRSLQDDPKLVLSMNVAKMQSEREYKKDF 2350
2351 EKWKTKFSSPVDMLGVVLAKKCQELVSDVDYKNYLHQWTCLPDQNDVVQA 2400
2401 KKVYELQSENLYKSDLEWLRGIGWSPLGSLEAEKNKRASEIISEKKYRQP 2450
2451 PDRNKFTSIPDAMDIVLAKTNAKNRSDRLYREAWDKDKTQIHIMPDTPDI 2500
2501 VLAKANLINTSDKLYRMGYEELKRKGYDLPVDAIPIKAAKASREIASEYK 2550
2551 YKEGFRKQLGHHIGARNIEDDPKMMWSMHVAKIQSDREYKKDFEKWKTKF 2600
2601 SSPVDMLGVVLAKKCQTLVSDVDYKNYLHQWTCLPDQSDVIHARQAYDLQ 2650
2651 SDNLYKSDLQWLKGIGWMTSGSLEDEKNKRATQILSDHVYRQHPDQFKFS 2700
2701 SLMDSIPMVLAKNNAITMNHRLYTEAWDKDKTTVHIMPDTPEVLLAKQNK 2750
2751 VNYSEKLYKLGLEEAKRKGYDMRVDAIPIKAAKASRDIASEFKYKEGYRK 2800
2801 QLGHHIGARAIRDDPKMMWSMHVAKIQSDREYKKDFEKWKTKFSSPVDML 2850
2851 GVVLAKKCQTLVSDVDYKNYLHQWTCLPDQSDVIHARQAYDLQSDNMYKS 2900
2901 DLQWMRGIGWVSIGSLDVEKCKRATEILSDKIYRQPPDRFKFTSVTDSLE 2950
2951 QVLAKNNAITMNKRLYTEAWDKDKTQIHIMPDTPEIMLARMNKINYSESL 3000
3001 YKLANEEAKKKGYDLRSDAIPIVAAKASRDIISDYKYKDGYCKQLGHHIG 3050
3051 ARNIEDDPKMMWSMHVAKIQSDREYKKDFEKWKTKFSSPVDMLGVVLAKK 3100
3101 CQTLVSDVDYKNYLHEWTCLPDQSDVIHARQAYDLQSDNIYKSDLQWLRG 3150
3151 IGWVPIGSMDVVKCKRATEILSDNIYRQPPDKLKFTSVTDSLEQVLAKNN 3200
3201 ALNMNKRLYTEAWDKDKTQIHIMPDTPEIMLARQNKINYSETLYKLANEE 3250
3251 AKKKGYDLRSDAIPIVAAKASRDVISDYKYKDGYRKQLGHHIGARNIEDD 3300
3301 PKMMWSMHVAKIQSDREYKKDFEKWKTKFSSPVDMLGVVLAKKCQTLVSD 3350
3351 VDYKNYLHEWTCLPDQNDVIHARQAYDLQSDNIYKSDLQWLRGIGWVPIG 3400
3401 SMDVVKCKRAAEILSDNIYRQPPDKLKFTSVTDSLEQVLAKNNALNMNKR 3450
3451 LYTEAWDKDKTQVHIMPDTPEIMLARQNKINYSESLYRQAMEEAKKEGYD 3500
3501 LRSDAIPIVAAKASRDIASDYKYKEAYRKQLGHHIGARAVHDDPKIMWSL 3550
3551 HIAKVQSDREYKKDFEKYKTRYSSPVDMLGIVLAKKCQTLVSDVDYKHPL 3600
3601 HEWICLPDQNDIIHARKAYDLQSDNLYKSDLEWMKGIGWVPIDSLEVVRA 3650
3651 KRAGELLSDTIYRQRPETLKFTSITDTPEQVLAKNNALNMNKRLYTEAWD 3700
3701 NDKKTIHVMPDTPEIMLAKLNRINYSDKLYKLALEESKKEGYDLRLDAIP 3750
3751 IQAAKASRDIASDYKYKEGYRKQLGHHIGARNIKDDPKMMWSIHVAKIQS 3800
3801 DREYKKEFEKWKTKFSSPVDMLGVVLAKKCQILVSDIDYKHPLHEWTCLP 3850
3851 DQNDVIQARKAYDLQSDAIYKSDLEWLRGIGWVPIGSVEVEKVKRAGEIL 3900
3901 SDRKYRQPADQLKFTCITDTPEIVLAKNNALTMSKHLYTEAWDADKTSIH 3950
3951 VMPDTPDILLAKSNSANISQKLYTKGWDESKMKDYDLRADAISIKSAKAS 4000
4001 RDIASDYKYKEAYEKQKGHHIGAQSIEDDPKIMCAIHAGKIQSEREYKKE 4050
4051 FQKWKTKFSSPVDMLSILLAKKCQTLVTDIDYRNYLHEWTCMPDQNDIIQ 4100
4101 AKKAYDLQSDSVYKADLEWLRGIGWMPEGSVEMNRVKVAQDLVNERLYRT 4150
4151 RPEALSFTSIVDTPEVVLAKANSLQISEKLYQEAWNKDKSNITIPSDTPE 4200
4201 MLQAHINALQISNKLYQKDWNDAKQKGYDIRADAIEIKHAKASREIASEY 4250
4251 KYKEGYRKQLGHHMGFRTLQDDPKSVWAIHAAKIQSDREYKKAYEKSKGI 4300
4301 HNTPLDMMSIVQAKKCQVLVSDIDYRNYLHQWTCLPDQNDVIQAKKAYDL 4350
4351 QSDNLYKSDLEWLKGIGWLPEGSVEVMRVKNAQNLLNERLYRIKPEALKF 4400
4401 TSIVDTPEVIQAKINAVQISEPLYRDAWEKEKANVNVPADTPLMLQSKIN 4450
4451 ALQISNKRYQQAWEDVKMTGYDLRADAIGIQHAKASRDIASDYLYKTAYE 4500
4501 KQKGHYIGCRSAKEDPKLVWAANVLKMQNDRLYKKAYNDHKAKISIPVDM 4550
4551 VSISAAKEGQALASDVDYRHYLHHWSCFPDQNDVIQARKAYDLQSDSVYK 4600
4601 ADLEWLRGIGWMPEGSVEMNRVKVAQDLVNERLYRTRPEALSFTSIVDTP 4650
4651 EVVLAKANSLQISEKLYQEAWNKDKSNITIPSDTPEMLQAHINALQISNK 4700
4701 LYQKDWNDTKQKGYDIRADAIEIKHAKASREIASEYKYKEGYRKQLGHHM 4750
4751 GFRTLQDDPKSVWAIHAAKIQSDREYKKAYEKSKGIHNTPLDMMSIVQAK 4800
4801 KCQVLVSDIDYRNYLHQWTCLPDQNDVIQAKKAYDLQSDNLYKSDLEWLK 4850
4851 GIGWLPEGSVEVMRVKNAQNLLNERLYRIKPEALKFTSIVDTPEVIQAKI 4900
4901 NAVQISEPLYRNAWEKEKANVNVPADTPLMLQSKINALQISNKRYQQAWE 4950
4951 DVKMTGYDLRADAIGIQHAKASRDIASDYLYKTAYEKQKGHYIGCRSAKE 5000
5001 DPKLVWAANVLKMQNDRLYKKAYNDHKAKISIPVDMVSISAAKEGQALAS 5050
5051 DVDYRHYLHHWSCFPDQNDVIQARKAYDLQSDSVYKADLEWLRGIGWMPE 5100
5101 GSVEMNRVKVAQDLVNERLYRTRPEALSFTSIVDTPEVVLAKANSLQISE 5150
5151 KLYQEAWNKDKSNITIPSDTPEMLQAHINALQISNKLYQKDWNDTKQKGY 5200
5201 DIRADAIEIKHAKASREIASEYKYKEGYRKQLGHHMGFRTLQDDPKSVWA 5250
5251 IHAAKIQSDREYKKAYEKSKGIHNTPLDMMSIVQAKKCQVLVSDIDYRNY 5300
5301 LHQWTCLPDQNDVIQAKKAYDLQSDNLYKSDLEWLKGIGWLPEGSVEVMR 5350
5351 VKNAQNLLNERLYRIKPEALKFTSIVDTPEVIQAKINAVQISEPLYRDAW 5400
5401 EKEKANVNVPADTPLMLQSKINALQISNKRYQQAWEDVKMTGYDLRADAI 5450
5451 GIQHAKASRDIASDYLYKTAYEKQKGHYIGCRSAKEDPKLVWAANVLKMQ 5500
5501 NDRLYKKAYNDHKAKISIPVDMVSISAAKEGQALASDVDYRHYLHRWSCF 5550
5551 PDQNDVIQARKAYDLQSDALYKADLEWLRGIGWMPQGSPEVLRVKNAQNI 5600
5601 FCDSVYRTPVVNLKYTSIVDTPEVVLAKSNAENISIPKYREVWDKDKTSI 5650
5651 HIMPDTPEINLARANALNVSNKLYREGWDEMKAGCDVRLDAIPIQAAKAS 5700
5701 REIASDYKYKLDHEKQKGHYVGTLTARDDNKIRWALIADKLQNEREYRLD 5750
5751 WAKWKAKIQSPVDMLSILHSKNSQALVSDMDYRNYLHQWTCMPDQNDVIQ 5800
5801 AKKAYELQSDNVYKADLEWLRGIGWMPNDSVSVNHAKHAADIFSEKKYRT 5850
5851 KIETLNFTPVDDRVDYVTAKQSGEILDDIKYRKDWNATKSKYTLTETPLL 5900
5901 HTAQEAARILDQYLYKEGWERQKATGYILPPDAVPFVHAHHCNDVQSELK 5950
5951 YKAEHVKQKGHYVGVPTMRDDPKLVWFEHAGQIQNERLYKEDYHKTKAKI 6000
6001 NIPADMVSVLAAKQGQTLVSDIDYRNYLHQWMCHPDQNDVIQARKAYDLQ 6050
6051 SDNVYRADLEWLRGIGWIPLDSVDHVRVTKNQEMMSQIKYKKNALENYPN 6100
6101 FRSVVDPPEIVLAKINSVNQSDVKYKETFNKAKGKYTFSPDTPHISHSKD 6150
6151 MGKLYSTILYKGAWEGTKAYGYTLDERYIPIVGAKHADLVNSELKYKETY 6200
6201 EKQKGHYLAGKVIGEFPGVVHCLDFQKMRSALNYRKHYEDTKANVHIPND 6250
6251 MMNHVLAKRCQYILSDLEYRHYFHQWTSLLEEPNVIRVRNAQEILSDNVY 6300
6301 KDDLNWLKGIGCYVWDTPQILHAKKSYDLQSQLQYTAAGKENLQNYNLVT 6350
6351 DTPLYVTAVQSGINASEVKYKENYHQIKDKYTTVLETVDYDRTRNLKNLY 6400
6401 SSNLYKEAWDRVKATSYILPSSTLSLTHAKNQKHLASHIKYREEYEKFKA 6450
6451 LYTLPRSVDDDPNTARCLRVGKLNIDRLYRSVYEKNKMKIHIVPDMVEMV 6500
6501 TAKDSQKKVSEIDYRLRLHEWICHPDLQVNDHVRKVTDQISDIVYKDDLN 6550
6551 WLKGIGCYVWDTPEILHAKHAYDLRDDIKYKAHMLKTRNDYKLVTDTPVY 6600
6601 VQAVKSGKQLSDAVYHYDYVHSVRGKVAPTTKTVDLDRALHAYKLQSSNL 6650
6651 YKTSLRTLPTGYRLPGDTPHFKHIKDTRYMSSYFKYKEAYEHTKAYGYTL 6700
6701 GPKDVPFVHVRRVNNVTSERLYRELYHKLKDKIHTTPDTPEIRQVKKTQE 6750
6751 AVSELIYKSDFFKMQGHMISLPYTPQVIHCRYVGDITSDIKYKEDLQVLK 6800
6801 GFGCFLYDTPDMVRSRHLRKLWSNYLYTDKARKMRDKYKVVLDTPEYRKV 6850
6851 QELKTHLSELVYRAAGKKQKSIFTSVPDTPDLLRAKRGQKLQSQYLYVEL 6900
6901 ATKERPHHHAGNQTTALKHAKDVKDMVSEKKYKIQYEKMKDKYTPVPDTP 6950
6951 ILIRAKRAYWNASDLRYKETFQKTKGKYHTVKDALDIVYHRKVTDDISKI 7000
7001 KYKENYMSQLGIWRSIPDRPEHFHHRAVTDTVSDVKYKEDLTWLKGIGCY 7050
7051 AYDTPDFTLAEKNKTLYSKYKYKEVFERTKSDFKYVADSPINRHFKYATQ 7100
7101 LMNERKYKSSAKMFLQHGCNEILRPDMLTALYNSHMWSQIKYRKNYEKSK 7150
7151 DKFTSIVDTPEHLRTTKVNKQISDILYKLEYNKAKPRGYTTIHDTPMLLH 7200
7201 VRKVKDEVSDLKYKEVYQRNKSNCTIEPDAVHIKAAKDAYKVNTNLDYKK 7250
7251 QYEANKAHWKWTPDRPDFLQAAKSSLQQSDFEYKLDREFLKGCKLSVTDD 7300
7301 KNTVLALRNTLIESDLKYKEKHVKERGTCHAVPDTPQILLAKTVSNLVSE 7350
7351 NKYKDHVKKHLAQGSYTTLPETRDTVHVKEVTKHVSDTNYKKKFVKEKGK 7400
7401 SNYSIMLEPPEVKHAMEVAKKQSDVAYRKDAKENLHYTTVADRPDIKKAT 7450
7451 QAAKQASEVEYRAKHRKEGSHGLSMLGRPDIEMAKKAAKLSSQVKYRENF 7500
7501 DKEKGKTPKYNPKDSQLYKVMKDANNLASEVKYKADLKKLHKPVTDMKES 7550
7551 LIMNHVLNTSQLASSYQYKKKYEKSKGHYHTIPDNLEQLHLKEATELQSI 7600
7601 VKYKEKYEKERGKPMLDFETPTYITAKESQQMQSGKEYRKDYEESIKGRN 7650
7651 LTGLEVTPALLHVKYATKIASEKEYRKDLEESIRGKGLTEMEDTPDMLRA 7700
7701 KNATQILNEKEYKRDLELEVKGRGLNAMANETPDFMRARNATDIASQIKY 7750
7751 KQSAEMEKANFTSVVDTPEIIHAQQVKNLSSQKKYKEDAEKSMSYYETVL 7800
7801 DTPEIQRVRENQKNFSLLQYQCDLKNSKGKITVVQDTPEILRVKENQKNF 7850
7851 SSVLYKEDVSPGTAIGKTPEMMRVKQTQDHISSVKYKEAIGQGTPIPDLP 7900
7901 EVKRVKETQKHISSVMYKENLGTGIPTTVTPEIERVKRNQENFSSVLYKE 7950
7951 NLGKGIPTPITPEMERVKRNQENFSSILYKENLSKGTPLPVTPEMERVKL 8000
8001 NQENFSSVLYKENVGKGIPIPITPEMERVKHNQENFSSVLYKENLGTGIP 8050
8051 IPITPEMQRVKHNQENLSSVLYKENMGKGTPLPVTPEMERVKHNQENISS 8100
8101 VLYKENMGKGTPLPVTPEMERVKHNQENISSVLYKENMGKGTPLAVTPEM 8150
8151 ERVKHNQENISSVLYKENVGKATATPVTPEMQRVKRNQENISSVLYKENL 8200
8201 GKATPTPFTPEMERVKRNQENFSSVLYKENMRKATPTPVTPEMERAKRNQ 8250
8251 ENISSVLYSDSFRKQIQGKAAYVLDTPEMRRVRETQRHISTVKYHEDFEK 8300
8301 HKGCFTPVVTDPITERVKKNMQDFSDINYRGIQRKVVEMEQKRNDQDQET 8350
8351 ITGLRVWRTNPGSVFDYDPAEDNIQSRSLHMINVQAQRRSREQSRSASAL 8400
8401 SISGGEEKSEHSEAPDHHLSTYSDGGVFAVSTAYKHAKTTELPQQRSSSV 8450
8451 ATQQTTVSSIPSHPSTAGKIFRAMYDYMAADADEVSFKDGDAIINVQAID 8500
8501 EGWMYGTVQRTGRTGMLPANYVEAI 8525

Positively and negatively influencing subsequences are coloured according to the following scale:

(non-nuclear) negative ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| positive (nuclear)

with NucPred



If you find NucPred useful, please cite this paper:
NucPred - Predicting Nuclear Localization of Proteins. Brameier M, Krings A, Maccallum RM. Bioinformatics, 2007. PubMed id: 17332022
The authors also look forward to your comments and suggestions.

What does the NucPred score mean?

You have to decide on a NucPred score threshold. Sequences which score greater than or equal to this threshold are predicted to spend some time in the nucleus. Higher thresholds yield fewer predicted nuclear proteins, but these predictions are more accurate (you can have higher confidence in them). The table below gives more details of the performance of NucPred estimated using the sequences it was trained on (by cross-validation). Another benchmark is available in the Bioinformatics 2007 paper.

NucPred score threshold Specificity Sensitivity
see above fraction of proteins predicted to be nuclear that actually are nuclear fraction of true nuclear proteins that are predicted (coverage)
0.10 0.45 0.88
0.20 0.52 0.83
0.30 0.57 0.77
0.40 0.63 0.69
0.50 0.70 0.62
0.60 0.71 0.53
0.70 0.81 0.44
0.80 0.84 0.32
0.90 0.88 0.21
1.00 1.00 0.02

Sequences which score >= 0.8 with NucPred and which are predicted by PredictNLS to contain an NLS have been shown to be 93% correct with a coverage of 16%. (PredictNLS by itself is 87% correct with 26% coverage on the same data.)

Go back to the NucPred Home Page.