 | Authors: Amine Heddad, Andrea Krings, Markus Brameier and Bob MacCallum, Stockholm Bioinformatics Center, Stockholm University, Sweden. |
NucPred
Fetching Q5VST9 from www.uniprot.org...
The NucPred score for your sequence is 0.63 (see score help below)
1 MDQPQFSGAPRFLTRPKAFVVSVGKDATLSCQIVGNPTPQVSWEKDQQPV 50
51 AAGARFRLAQDGDLYRLTILDLALGDSGQYVCRARNAIGEAFAAVGLQVD 100
101 AEAACAEQAPHFLLRPTSIRVREGSEATFRCRVGGSPRPAVSWSKDGRRL 150
151 GEPDGPRVRVEELGEASALRIRAARPRDGGTYEVRAENPLGAASAAAALV 200
201 VDSDAADTASRPGTSTAALLAHLQRRREAMRAEGAPASPPSTGTRTCTVT 250
251 EGKHARLSCYVTGEPKPETVWKKDGQLVTEGRRHVVYEDAQENFVLKILF 300
301 CKQSDRGLYTCTASNLVGQTYSSVLVVVREPAVPFKKRLQDLEVREKESA 350
351 TFLCEVPQPSTEAAWFKEETRLWASAKYGIEEEGTERRLTVRNVSADDDA 400
401 VYICETPEGSRTVAELAVQGNLLRKLPRKTAVRVGDTAMFCVELAVPVGP 450
451 VHWLRNQEEVVAGGRVAISAEGTRHTLTISQCCLEDVGQVAFMAGDCQTS 500
501 TQFCVSAPRKPPLQPPVDPVVKARMESSVILSWSPPPHGERPVTIDGYLV 550
551 EKKKLGTYTWIRCHEAEWVATPELTVADVAEEGNFQFRVSALNSFGQSPY 600
601 LEFPGTVHLAPKLAVRTPLKAVQAVEGGEVTFSVDLTVASAGEWFLDGQA 650
651 LKASSVYEIHCDRTRHTLTIREVPASLHGAQLKFVANGIESSIRMEVRAA 700
701 PGLTANKPPAAAAREVLARLHEEAQLLAELSDQAAAVTWLKDGRTLSPGP 750
751 KYEVQASAGRRVLLVRDVARDDAGLYECVSRGGRIAYQLSVQGLARFLHK 800
801 DMAGSCVDAVAGGPAQFECETSEAHVHVHWYKDGMELGHSGERFLQEDVG 850
851 TRHRLVAATVTRQDEGTYSCRVGEDSVDFRLRVSEPKVVFAKEQLARRKL 900
901 QAEAGASATLSCEVAQAQTEVTWYKDGKKLSSSSKVCMEATGCTRRLVVQ 950
951 QAGQADAGEYSCEAGGQRLSFHLDVKEPKVVFAKDQVAHSEVQAEAGASA 1000
1001 TLSCEVAQAQTEVMWYKDGKKLSSSLKVHVEAKGCRRRLVVQQAGKTDAG 1050
1051 DYSCEARGQRVSFRLHITEPKMMFAKEQSVHNEVQAEAGASAMLSCEVAQ 1100
1101 AQTEVTWYKDGKKLSSSSKVGMEVKGCTRRLVLPQAGKADAGEYSCEAGG 1150
1151 QRVSFHLHITEPKGVFAKEQSVHNEVQAEAGTTAMLSCEVAQPQTEVTWY 1200
1201 KDGKKLSSSSKVRMEVKGCTRRLVVQQVGKADAGEYSCEAGGQRVSFQLH 1250
1251 ITEPKAVFAKEQLVHNEVRTEAGASATLSCEVAQAQTEVTWYKDGKKLSS 1300
1301 SSKVRIEAAGCMRQLVVQQAGQADAGEYTCEAGGQRLSFHLDVSEPKAVF 1350
1351 AKEQLAHRKVQAEAGAIATLSCEVAQAQTEVTWYKDGKKLSSSSKVRMEA 1400
1401 VGCTRRLVVQQACQADTGEYSCEAGGQRLSFSLDVAEPKVVFAKEQPVHR 1450
1451 EVQAQAGASTTLSCEVAQAQTEVMWYKDGKKLSFSSKVRMEAVGCTRRLV 1500
1501 VQQAGQAVAGEYSCEAGSQRLSFHLHVAEPKAVFAKEQPASREVQAEAGT 1550
1551 SATLSCEVAQAQTEVTWYKDGKKLSSSSKVRMEAVGCTRRLVVQEAGQAD 1600
1601 AGEYSCKAGDQRLSFHLHVAEPKVVFAKEQPAHREVQAEAGASATLSCEV 1650
1651 AQAQTEVTWYKDGKKLSSSSKVRVEAVGCTRRLVVQQAGQAEAGEYSCEA 1700
1701 GGQQLSFRLHVAELEPQISERPCRREPLVVKEHEDIILTATLATPSAATV 1750
1751 TWLKDGVEIRRSKRHETASQGDTHTLTVHGAQVLDSAIYSCRVGAEGQDF 1800
1801 PVQVEEVAAKFCRLLEPVCGELGGTVTLACELSPACAEVVWRCGNTQLRV 1850
1851 GKRFQMVAEGPVRSLTVLGLRAEDAGEYVCESRDDHTSAQLTVSVPRVVK 1900
1901 FMSGLSTVVAEEGGEATFQCVVSPSDVAVVWFRDGALLQPSEKFAISQSG 1950
1951 ASHSLTISDLVLEDAGQITVEAEGASSSAALRVREAPVLFKKKLEPQTVE 2000
2001 ERSSVTLEVELTRPWPELRWTRNATALAPGKNVEIHAEGARHRLVLHNVG 2050
2051 FADRGFFGCETPDDKTQAKLTVEMRQVRLVRGLQAVEAREQGTATMEVQL 2100
2101 SHADVDGSWTRDGLRFQQGPTCHLAVRGPMHTLTLSGLRPEDSGLMVFKA 2150
2151 EGVHTSARLVVTELPVSFSRPLQDVVTTEKEKVTLECELSRPNVDVRWLK 2200
2201 DGVELRAGKTMAIAAQGACRSLTIYRCEFADQGVYVCDAHDAQSSASVKV 2250
2251 QGRTYTLIYRRVLAEDAGEIQFVAENAESRAQLRVKELPVTLVRPLRDKI 2300
2301 AMEKHRGVLECQVSRASAQVRWFKGSQELQPGPKYELVSDGLYRKLIISD 2350
2351 VHAEDEDTYTCDAGDVKTSAQFFVEEQSITIVRGLQDVTVMEPAPAWFEC 2400
2401 ETSIPSVRPPKWLLGKTVLQAGGNVGLEQEGTVHRLMLRRTCSTMTGPVH 2450
2451 FTVGKSRSSARLVVSDIPVVLTRPLEPKTGRELQSVVLSCDFRPAPKAVQ 2500
2501 WYKDDTPLSPSEKFKMSLEGQMAELRILRLMPADAGVYRCQAGSAHSSTE 2550
2551 VTVEAREVTVTGPLQDAEATEEGWASFSCELSHEDEEVEWSLNGMPLYND 2600
2601 SFHEISHKGRRHTLVLKSIQRADAGIVRASSLKVSTSARLEVRVKPVVFL 2650
2651 KALDDLSAEERGTLALQCEVSDPEAHVVWRKDGVQLGPSDKYDFLHTAGT 2700
2701 RGLVVHDVSPEDAGLYTCHVGSEETRARVRVHDLHVGITKRLKTMEVLEG 2750
2751 ESCSFECVLSHESASDPAMWTVGGKTVGSSSRFQATRQGRKYILVVREAA 2800
2801 PSDAGEVVFSVRGLTSKASLIVRERPAAIIKPLEDQWVAPGEDVELRCEL 2850
2851 SRAGTPVHWLKDRKAIRKSQKYDVVCEGTMAMLVIRGASLKDAGEYTCEV 2900
2901 EASKSTASLHVEEKANCFTEELTNLQVEEKGTAVFTCKTEHPAATVTWRK 2950
2951 GLLELRASGKHQPSQEGLTLRLTISALEKADSDTYTCDIGQAQSRAQLLV 3000
3001 QGRRVHIIEDLEDVDVQEGSSATFRCRISPANYEPVHWFLDKTPLHANEL 3050
3051 NEIDAQPGGYHVLTLRQLALKDSGTIYFEAGDQRASAALRVTEKPSVFSR 3100
3101 ELTDATITEGEDLTLVCETSTCDIPVCWTKDGKTLRGSARCQLSHEGHRA 3150
3151 QLLITGATLQDSGRYKCEAGGACSSSIVRVHARPVRFQEALKDLEVLEGG 3200
3201 AATLRCVLSSVAAPVKWCYGNNVLRPGDKYSLRQEGAMLELVVRNLRPQD 3250
3251 SGRYSCSFGDQTTSATLTVTALPAQFIGKLRNKEATEGATATLRCELSKA 3300
3301 APVEWRKGSETLRDGDRYCLRQDGAMCELQIRGLAMVDAAEYSCVCGEER 3350
3351 TSASLTIRPMPAHFIGRLRHQESIEGATATLRCELSKAAPVEWRKGRESL 3400
3401 RDGDRHSLRQDGAVCELQICGLAVADAGEYSCVCGEERTSATLTVKALPA 3450
3451 KFTEGLRNEEAVEGATAMLWCELSKVAPVEWRKGPENLRDGDRYILRQEG 3500
3501 TRCELQICGLAMADAGEYLCVCGQERTSATLTIRALPARFIEDVKNQEAR 3550
3551 EGATAVLQCELNSAAPVEWRKGSETLRDGDRYSLRQDGTKCELQIRGLAM 3600
3601 ADTGEYSCVCGQERTSAMLTVRALPIKFTEGLRNEEATEGATAVLRCELS 3650
3651 KMAPVEWWKGHETLRDGDRHSLRQDGARCELQIRGLVAEDAGEYLCMCGK 3700
3701 ERTSAMLTVRAMPSKFIEGLRNEEATEGDTATLWCELSKAAPVEWRKGHE 3750
3751 TLRDGDRHSLRQDGSRCELQIRGLAVVDAGEYSCVCGQERTSATLTVRAL 3800
3801 PARFIEDVKNQEAREGATAVLQCELSKAAPVEWRKGSETLRGGDRYSLRQ 3850
3851 DGTRCELQIHGLSVADTGEYSCVCGQERTSATLTVKAPQPVFREPLQSLQ 3900
3901 AEEGSTATLQCELSEPTATVVWSKGGLQLQANGRREPRLQGCTAELVLQD 3950
3951 LQREDTGEYTCTCGSQATSATLTVTAAPVRFLRELQHQEVDEGGTAHLCC 4000
4001 ELSRAGASVEWRKGSLQLFPCAKYQMVQDGAAAELLVRGVEQEDAGDYTC 4050
4051 DTGHTQSMASLSVRVPRPKFKTRLQSLEQETGDIARLCCQLSDAESGAVV 4100
4101 QWLKEGVELHAGPKYEMRSQGATRELLIHQLEAKDTGEYACVTGGQKTAA 4150
4151 SLRVTEPEVTIVRGLVDAEVTADEDVEFSCEVSRAGATGVQWCLQGLPLQ 4200
4201 SNEVTEVAVRDGRIHTLRLKGVTPEDAGTVSFHLGNHASSAQLTVRAPEV 4250
4251 TILEPLQDVQLSEGQDASFQCRLSRASGQEARWALGGVPLQANEMNDITV 4300
4301 EQGTLHLLTLHKVTLEDAGTVSFHVGTCSSEAQLKVTAKNTVVRGLENVE 4350
4351 ALEGGEALFECQLSQPEVAAHTWLLDDEPVHTSENAEVVFFENGLRHLLL 4400
4401 LKNLRPQDSCRVTFLAGDMVTSAFLTVRGWRLEILEPLKNAAVRAGAQAC 4450
4451 FTCTLSEAVPVGEASWYINGAAVQPDDSDWTVTADGSHHALLLRSAQPHH 4500
4501 AGEVTFACRDAVASARLTVLGLPDPPEDAEVVARSSHTVTLSWAAPMSDG 4550
4551 GGGLCGYRVEVKEGATGQWRLCHELVPGPECVVDGLAPGETYRFRVAAVG 4600
4601 PVGAGEPVHLPQTVRLAEPPKPVPPQPSAPESRQVAAGEDVSLELEVVAE 4650
4651 AGEVIWHKGMERIQPGGRFEVVSQGRQQMLVIKGFTAEDQGEYHCGLAQG 4700
4701 SICPAAATFQVALSPASVDEAPQPSLPPEAAQEGDLHLLWEALARKRRMS 4750
4751 REPTLDSISELPEEDGRSQRLPQEAEEVAPDLSEGYSTADELARTGDADL 4800
4801 SHTSSDDESRAGTPSLVTYLKKAGRPGTSPLASKVGAPAAPSVKPQQQQE 4850
4851 PLAAVRPPLGDLSTKDLGDPSMDKAAVKIQAAFKGYKVRKEMKQQEGPMF 4900
4901 SHTFGDTEAQVGDALRLECVVASKADVRARWLKDGVELTDGRHHHIDQLG 4950
4951 DGTCSLLITGLDRADAGCYTCQVSNKFGQVTHSACVVVSGSESEAESSSG 5000
5001 GELDDAFRRAARRLHRLFRTKSPAEVSDEELFLSADEGPAEPEEPADWQT 5050
5051 YREDEHFICIRFEALTEARQAVTRFQEMFATLGIGVEIKLVEQGPRRVEM 5100
5101 CISKETPAPVVPPEPLPSLLTSDAAPVFLTELQNQEVQDGYPVSFDCVVT 5150
5151 GQPMPSVRWFKDGKLLEEDDHYMINEDQQGGHQLIITAVVPADMGVYRCL 5200
5201 AENSMGVSSTKAELRVDLTSTDYDTAADATESSSYFSAQGYLSSREQEGT 5250
5251 ESTTDEGQLPQVVEELRDLQVAPGTRLAKFQLKVKGYPAPRLYWFKDGQP 5300
5301 LTASAHIRMTDKKILHTLEIISVTREDSGQYAAYISNAMGAAYSSARLLV 5350
5351 RGPDEPEEKPASDVHEQLVPPRMLERFTPKKVKKGSSITFSVKVEGRPVP 5400
5401 TVHWLREEAERGVLWIGPDTPGYTVASSAQQHSLVLLDVGRQHQGTYTCI 5450
5451 ASNAAGQALCSASLHVSGLPKVEEQEKVKEALISTFLQGTTQAISAQGLE 5500
5501 TASFADLGGQRKEEPLAAKEALGHLSLAEVGTEEFLQKLTSQITEMVSAK 5550
5551 ITQAKLQVPGGDSDEDSKTPSASPRHGRSRPSSSIQESSSESEDGDARGE 5600
5601 IFDIYVVTADYLPLGAEQDAITLREGQYVEVLDAAHPLRWLVRTKPTKSS 5650
5651 PSRQGWVSPAYLDRRLKLSPEWGAAEAPEFPGEAVSEDEYKARLSSVIQE 5700
5701 LLSSEQAFVEELQFLQSHHLQHLERCPHVPIAVAGQKAVIFRNVRDIGRF 5750
5751 HSSFLQELQQCDTDDDVAMCFIKNQAAFEQYLEFLVGRVQAESVVVSTAI 5800
5801 QEFYKKYAEEALLAGDPSQPPPPPLQHYLEQPVERVQRYQALLKELIRNK 5850
5851 ARNRQNCALLEQAYAVVSALPQRAENKLHVSLMENYPGTLQALGEPIRQG 5900
5901 HFIVWEGAPGARMPWKGHNRHVFLFRNHLVICKPRRDSRTDTVSYVFRNM 5950
5951 MKLSSIDLNDQVEGDDRAFEVWQEREDSVRKYLLQARTAIIKSSWVKEIC 6000
6001 GIQQRLALPVWRPPDFEEELADCTAELGETVKLACRVTGTPKPVISWYKD 6050
6051 GKAVQVDPHHILIEDPDGSCALILDSLTGVDSGQYMCFAASAAGNCSTLG 6100
6101 KILVQVPPRFVNKVRASPFVEGEDAQFTCTIEGAPYPQIRWYKDGALLTT 6150
6151 GNKFQTLSEPRSGLLVLVIRAASKEDLGLYECELVNRLGSARASAELRIQ 6200
6201 SPMLQAQEQCHREQLVAAVEDTTLERADQEVTSVLKRLLGPKAPGPSTGD 6250
6251 LTGPGPCPRGAPALQETGSQPPVTGTSEAPAVPPRVPQPLLHEGPEQEPE 6300
6301 AIARAQEWTVPIRMEGAAWPGAGTGELLWDVHSHVVRETTQRTYTYQAID 6350
6351 THTARPPSMQVTIEDVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLVDS 6400
6401 TRLSQQQEGTTYSLVLRHVASKDAGVYTCLAQNTGGQVLCKAELLVLGGD 6450
6451 NEPDSEKQSHRRKLHSFYEVKEEIGRGVFGFVKRVQHKGNKILCAAKFIP 6500
6501 LRSRTRAQAYRERDILAALSHPLVTGLLDQFETRKTLILILELCSSEELL 6550
6551 DRLYRKGVVTEAEVKVYIQQLVEGLHYLHSHGVLHLDIKPSNILMVHPAR 6600
6601 EDIKICDFGFAQNITPAELQFSQYGSPEFVSPEIIQQNPVSEASDIWAMG 6650
6651 VISYLSLTCSSPFAGESDRATLLNVLEGRVSWSSPMAAHLSEDAKDFIKA 6700
6701 TLQRAPQARPSAAQCLSHPWFLKSMPAEEAHFINTKQLKFLLARSRWQRS 6750
6751 LMSYKSILVMRSIPELLRGPPDSPSLGVARHLCRDTGGSSSSSSSSDNEL 6800
6801 APFARAKSLPPSPVTHSPLLHPRGFLRPSASLPEEAEASERSTEAPAPPA 6850
6851 SPEGAGPPAAQGCVPRHSVIRSLFYHQAGESPEHGALAPGSRRHPARRRH 6900
6901 LLKGGYIAGALPGLREPLMEHRVLEEEAAREEQATLLAKAPSFETALRLP 6950
6951 ASGTHLAPGHSHSLEHDSPSTPRPSSEACGEAQRLPSAPSGGAPIRDMGH 7000
7001 PQGSKQLPSTGGHPGTAQPERPSPDSPWGQPAPFCHPKQGSAPQEGCSPH 7050
7051 PAVAPCPPGSFPPGSCKEAPLVPSSPFLGQPQAPPAPAKASPPLDSKMGP 7100
7101 GDISLPGRPKPGPCSSPGSASQASSSQVSSLRVGSSQVGTEPGPSLDAEG 7150
7151 WTQEAEDLSDSTPTLQRPQEQATMRKFSLGGRGGYAGVAGYGTFAFGGDA 7200
7201 GGMLGQGPMWARIAWAVSQSEEEEQEEARAESQSEEQQEARAESPLPQVS 7250
7251 ARPVPEVGRAPTRSSPEPTPWEDIGQVSLVQIRDLSGDAEAADTISLDIS 7300
7301 EVDPAYLNLSDLYDIKYLPFEFMIFRKVPKSAQPEPPSPMAEEELAEFPE 7350
7351 PTWPWPGELGPHAGLEITEESEDVDALLAEAAVGRKRKWSSPSRSLFHFP 7400
7401 GRHLPLDEPAELGLRERVKASVEHISRILKGRPEGLEKEGPPRKKPGLAS 7450
7451 FRLSGLKSWDRAPTFLRELSDETVVLGQSVTLACQVSAQPAAQATWSKDG 7500
7501 APLESSSRVLISATLKNFQLLTILVVVAEDLGVYTCSVSNALGTVTTTGV 7550
7551 LRKAERPSSSPCPDIGEVYADGVLLVWKPVESYGPVTYIVQCSLEGGSWT 7600
7601 TLASDIFDCCYLTSKLSRGGTYTFRTACVSKAGMGPYSSPSEQVLLGGPS 7650
7651 HLASEEESQGRSAQPLPSTKTFAFQTQIQRGRFSVVRQCWEKASGRALAA 7700
7701 KIIPYHPKDKTAVLREYEALKGLRHPHLAQLHAAYLSPRHLVLILELCSG 7750
7751 PELLPCLAERASYSESEVKDYLWQMLSATQYLHNQHILHLDLRSENMIIT 7800
7801 EYNLLKVVDLGNAQSLSQEKVLPSDKFKDYLETMAPELLEGQGAVPQTDI 7850
7851 WAIGVTAFIMLSAEYPVSSEGARDLQRGLRKGLVRLSRCYAGLSGGAVAF 7900
7901 LRSTLCAQPWGRPCASSCLQCPWLTEEGPACSRPAPVTFPTARLRVFVRN 7950
7951 REKRRALLYKRHNLAQVR 7968
Positively and negatively influencing subsequences are coloured according to the following scale:
(non-nuclear) negative ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| positive (nuclear)
What does the NucPred score mean?
| You have to decide on a NucPred score threshold. Sequences which score greater than or equal to this threshold are predicted to spend some time in the nucleus. Higher thresholds yield fewer predicted nuclear proteins, but these predictions are more accurate (you can have higher confidence in them). The table below gives more details of the performance of NucPred estimated using the sequences it was trained on (by cross-validation). Another benchmark is available in the Bioinformatics 2007 paper. |
| NucPred score threshold | Specificity | Sensitivity |
| see above | fraction of proteins predicted to be nuclear that actually are nuclear | fraction of true nuclear proteins that are predicted (coverage) |
| 0.10 | 0.45 | 0.88 |
| 0.20 | 0.52 | 0.83 |
| 0.30 | 0.57 | 0.77 |
| 0.40 | 0.63 | 0.69 |
| 0.50 | 0.70 | 0.62 |
| 0.60 | 0.71 | 0.53 |
| 0.70 | 0.81 | 0.44 |
| 0.80 | 0.84 | 0.32 |
| 0.90 | 0.88 | 0.21 |
| 1.00 | 1.00 | 0.02 |
| Sequences which score >= 0.8 with NucPred and which
are predicted by PredictNLS to contain an NLS have been shown to be 93% correct with a coverage of 16%. (PredictNLS by itself is 87% correct with 26% coverage on the same data.) |
Go back to the NucPred Home Page.