SBC logo Authors: Amine Heddad, Andrea Krings, Markus Brameier and Bob MacCallum, Stockholm Bioinformatics Center, Stockholm University, Sweden.

NucPred

Fetching P02751 from www.uniprot.org...

The NucPred score for your sequence is 0.39 (see score help below)

   1  MLRGPGPGLLLLAVQCLGTAVPSTGASKSKRQAQQMVQPQSPVAVSQSKP    50
51 GCYDNGKHYQINQQWERTYLGNALVCTCYGGSRGFNCESKPEAEETCFDK 100
101 YTGNTYRVGDTYERPKDSMIWDCTCIGAGRGRISCTIANRCHEGGQSYKI 150
151 GDTWRRPHETGGYMLECVCLGNGKGEWTCKPIAEKCFDHAAGTSYVVGET 200
201 WEKPYQGWMMVDCTCLGEGSGRITCTSRNRCNDQDTRTSYRIGDTWSKKD 250
251 NRGNLLQCICTGNGRGEWKCERHTSVQTTSSGSGPFTDVRAAVYQPQPHP 300
301 QPPPYGHCVTDSGVVYSVGMQWLKTQGNKQMLCTCLGNGVSCQETAVTQT 350
351 YGGNSNGEPCVLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQKYSF 400
401 CTDHTVLVQTRGGNSNGALCHFPFLYNNHNYTDCTSEGRRDNMKWCGTTQ 450
451 NYDADQKFGFCPMAAHEEICTTNEGVMYRIGDQWDKQHDMGHMMRCTCVG 500
501 NGRGEWTCIAYSQLRDQCIVDDITYNVNDTFHKRHEEGHMLNCTCFGQGR 550
551 GRWKCDPVDQCQDSETGTFYQIGDSWEKYVHGVRYQCYCYGRGIGEWHCQ 600
601 PLQTYPSSSGPVEVFITETPSQPNSHPIQWNAPQPSHISKYILRWRPKNS 650
651 VGRWKEATIPGHLNSYTIKGLKPGVVYEGQLISIQQYGHQEVTRFDFTTT 700
701 STSTPVTSNTVTGETTPFSPLVATSESVTEITASSFVVSWVSASDTVSGF 750
751 RVEYELSEEGDEPQYLDLPSTATSVNIPDLLPGRKYIVNVYQISEDGEQS 800
801 LILSTSQTTAPDAPPDTTVDQVDDTSIVVRWSRPQAPITGYRIVYSPSVE 850
851 GSSTELNLPETANSVTLSDLQPGVQYNITIYAVEENQESTPVVIQQETTG 900
901 TPRSDTVPSPRDLQFVEVTDVKVTIMWTPPESAVTGYRVDVIPVNLPGEH 950
951 GQRLPISRNTFAEVTGLSPGVTYYFKVFAVSHGRESKPLTAQQTTKLDAP 1000
1001 TNLQFVNETDSTVLVRWTPPRAQITGYRLTVGLTRRGQPRQYNVGPSVSK 1050
1051 YPLRNLQPASEYTVSLVAIKGNQESPKATGVFTTLQPGSSIPPYNTEVTE 1100
1101 TTIVITWTPAPRIGFKLGVRPSQGGEAPREVTSDSGSIVVSGLTPGVEYV 1150
1151 YTIQVLRDGQERDAPIVNKVVTPLSPPTNLHLEANPDTGVLTVSWERSTT 1200
1201 PDITGYRITTTPTNGQQGNSLEEVVHADQSSCTFDNLSPGLEYNVSVYTV 1250
1251 KDDKESVPISDTIIPEVPQLTDLSFVDITDSSIGLRWTPLNSSTIIGYRI 1300
1301 TVVAAGEGIPIFEDFVDSSVGYYTVTGLEPGIDYDISVITLINGGESAPT 1350
1351 TLTQQTAVPPPTDLRFTNIGPDTMRVTWAPPPSIDLTNFLVRYSPVKNEE 1400
1401 DVAELSISPSDNAVVLTNLLPGTEYVVSVSSVYEQHESTPLRGRQKTGLD 1450
1451 SPTGIDFSDITANSFTVHWIAPRATITGYRIRHHPEHFSGRPREDRVPHS 1500
1501 RNSITLTNLTPGTEYVVSIVALNGREESPLLIGQQSTVSDVPRDLEVVAA 1550
1551 TPTSLLISWDAPAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATISGLK 1600
1601 PGVDYTITVYAVTGRGDSPASSKPISINYRTEIDKPSQMQVTDVQDNSIS 1650
1651 VKWLPSSSPVTGYRVTTTPKNGPGPTKTKTAGPDQTEMTIEGLQPTVEYV 1700
1701 VSVYAQNPSGESQPLVQTAVTNIDRPKGLAFTDVDVDSIKIAWESPQGQV 1750
1751 SRYRVTYSSPEDGIHELFPAPDGEEDTAELQGLRPGSEYTVSVVALHDDM 1800
1801 ESQPLIGTQSTAIPAPTDLKFTQVTPTSLSAQWTPPNVQLTGYRVRVTPK 1850
1851 EKTGPMKEINLAPDSSSVVVSGLMVATKYEVSVYALKDTLTSRPAQGVVT 1900
1901 TLENVSPPRRARVTDATETTITISWRTKTETITGFQVDAVPANGQTPIQR 1950
1951 TIKPDVRSYTITGLQPGTDYKIYLYTLNDNARSSPVVIDASTAIDAPSNL 2000
2001 RFLATTPNSLLVSWQPPRARITGYIIKYEKPGSPPREVVPRPRPGVTEAT 2050
2051 ITGLEPGTEYTIYVIALKNNQKSEPLIGRKKTDELPQLVTLPHPNLHGPE 2100
2101 ILDVPSTVQKTPFVTHPGYDTGNGIQLPGTSGQQPSVGQQMIFEEHGFRR 2150
2151 TTPPTTATPIRHRPRPYPPNVGEEIQIGHIPREDVDYHLYPHGPGLNPNA 2200
2201 STGQEALSQTTISWAPFQDTSEYIISCHPVGTDEEPLQFRVPGTSTSATL 2250
2251 TGLTRGATYNVIVEALKDQQRHKVREEVVTVGNSVNEGLNQPTDDSCFDP 2300
2301 YTVSHYAVGDEWERMSESGFKLLCQCLGFGSGHFRCDSSRWCHDNGVNYK 2350
2351 IGEKWDRQGENGQMMSCTCLGNGKGEFKCDPHEATCYDDGKTYHVGEQWQ 2400
2401 KEYLGAICSCTCFGGQRGWRCDNCRRPGGEPSPEGTTGQSYNQYSQRYHQ 2450
2451 RTNTNVNCPIECFMPLDVQADREDSRE 2477

Positively and negatively influencing subsequences are coloured according to the following scale:

(non-nuclear) negative ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| positive (nuclear)

with NucPred



If you find NucPred useful, please cite this paper:
NucPred - Predicting Nuclear Localization of Proteins. Brameier M, Krings A, Maccallum RM. Bioinformatics, 2007. PubMed id: 17332022
The authors also look forward to your comments and suggestions.

What does the NucPred score mean?

You have to decide on a NucPred score threshold. Sequences which score greater than or equal to this threshold are predicted to spend some time in the nucleus. Higher thresholds yield fewer predicted nuclear proteins, but these predictions are more accurate (you can have higher confidence in them). The table below gives more details of the performance of NucPred estimated using the sequences it was trained on (by cross-validation). Another benchmark is available in the Bioinformatics 2007 paper.

NucPred score threshold Specificity Sensitivity
see above fraction of proteins predicted to be nuclear that actually are nuclear fraction of true nuclear proteins that are predicted (coverage)
0.10 0.45 0.88
0.20 0.52 0.83
0.30 0.57 0.77
0.40 0.63 0.69
0.50 0.70 0.62
0.60 0.71 0.53
0.70 0.81 0.44
0.80 0.84 0.32
0.90 0.88 0.21
1.00 1.00 0.02

Sequences which score >= 0.8 with NucPred and which are predicted by PredictNLS to contain an NLS have been shown to be 93% correct with a coverage of 16%. (PredictNLS by itself is 87% correct with 26% coverage on the same data.)

Go back to the NucPred Home Page.