| Authors: Amine Heddad, Andrea Krings, Markus Brameier and Bob MacCallum, Stockholm Bioinformatics Center, Stockholm University, Sweden. |
NucPred
Fetching Q8WXH0 from www.uniprot.org...
The NucPred score for your sequence is 0.96 (see score help below)
1 MASSPELPTEDEQGSWGIDDLHISLQAEQEDTQKKAFTCWINSQLARHTS 50
51 PSVISDLFTDIKKGHVLLDLLEVLSGQQLPRDKGSNTFQCRINIEHALTF 100
101 LRNRSIKLINIHVTDIIDGNPSIILGLIWTIILHFHIEKLAQTLSCNYNQ 150
151 PSLDDVSVVDSSPASSPPAKKCSKVQARWQMSARKALLLWAQEQCATYES 200
201 VNVTDFKSSWRNGMAFLAIIHALRPDLIDMKSVKHRSNKDNLREAFRIAE 250
251 QELKIPRLLEPEDVDVVDPDEKSIMTYVAQFLQYSKDAPGTGEEAQGKVK 300
301 DAMGWLTLQKEKLQKLLKDSENDTYFKKYNSLLSFMESFNEEKKSFLDVL 350
351 SIKRDLDELDKDHLQLREAWDGLDHQINAWKIKLNYALPPPLHQTEAWLQ 400
401 EVEELMDEDLSASQDHSQAVTLIQEKMTLFKSLMDRFEHHSNILLTFENK 450
451 DENHLPLVPPNKLEEMKRRINNILEKKFILLLEFHYYKCLVLGLVDEVKS 500
501 KLDIWNIKYGSRESVELLLEDWHKFIEEKEFLARLDTSFQKCGEIYKNLA 550
551 GECQNINKQYMMVKSDVCMYRKNIYNVKSTLQKVLACWATYVENLRLLRA 600
601 CFEETKKEEIKEVPFETLAQWNLEHATLNEAGNFLVEVSNDVVGSSISKE 650
651 LRRLNKRWRKLVSKTQLEMNLPLMIKKQDQPTFDNSGNILSKEEKATVEF 700
701 STDMSVELPENYNQNIKAGEKHEKENEEFTGQLKVAKDVEKLIGQVEIWE 750
751 AEAKSVLDQDDVDTSMEESLKHLIAKGSMFDELMARSEDMLQMDIQNISS 800
801 QESFQHVLTTGLQAKIQEAKEKVQINVVKLIAALKNLTDVSPDLDIRLKM 850
851 EESQKELESYMMRAQQLLGQRESPGELISKHKEALIISNTKSLAKYLKAV 900
901 EELKNNVTEDIKMSLEEKSRDVCAKWESLHHELSLYVQQLKIDIEKGKLS 950
951 DNILKLEKQINKEKKLIRRGRTKGLIKEHEACFSEEGCLYQLNHHMEVLR 1000
1001 ELCEELPSQKSQQEVKRLLKDYEQKIERLLKCASEIHMTLQPTAGGTSKN 1050
1051 EGTITTSENRGGDPHSEAPFAKSDNQPSTEKAMEPTMKFSLASVLRPLQE 1100
1101 ESIMEKDYSASINSLLERYDTYRDILEHHLQNNKFRITSDFSSEEDRSSS 1150
1151 CLQAKLTDLQVIKNETDARWKEFEIISLKLENHVNDIKKPFVIKERDTLK 1200
1201 ERERELQMTLNTRMESLETALRLVLPVEKASLLLCGSDLPLHKMAIQGFH 1250
1251 LIDADRIYQHLRNIQDSIAKQIEICNRLEEPGNFVLKELHPFDLHAMQNI 1300
1301 ILKYKTQFEGMNHRVQRSEDTLKALEDFLASLRTAKLSAEPVTDLSASDT 1350
1351 QVAQENTLTVKNKEGEIHLMKDKAKHLDKCLKMLDMSFKDAERGDDTSCE 1400
1401 NLLDAFSIKLSETHGYGVQEEFTEENKLLEACIFKNNELLKNIQDVQSQI 1450
1451 SKIGLKDPTVPAVKHRKKSLIRLDKVLDEYEEEKRHLQEMANSLPHFKDG 1500
1501 REKTVNQQCQNTVVLWENTKALVTECLEQCGRVLELLKQYQNFKSILTTL 1550
1551 IQKEESVISLQASYMGKENLKKRIAEIEIVKEEFNEHLEVVDKINQVCKN 1600
1601 LQFYLNKMKTFEEPPFEKEANIIVDRWLDINEKTEDYYENLGRALALWDK 1650
1651 LFNLKNVIDEWTEKALQKMELHQLTEEDRERLKEELQVHEQKTSEFSRRV 1700
1701 AEIQFLLQSSEIPLELQVMESSILNKMEHVQKCLTGESNCHALSGSTAEL 1750
1751 REDLDQAKTQIGMTESLLKALSPSDSLEIFTKLEEIQQQILQQKHSMILL 1800
1801 ENQIGCLTPELSELKKQYESVSDLFNTKKSVLQDHFSKLLNDQCKNFNDW 1850
1851 FSNIKVNLKECFESSETKKSVEQKLQKLSDFLTLEGRNSKIKQVDSVLKH 1900
1901 VKKHLPKAHVKELISWLVGQEFELEKMESICQARAKELEDSLQQLLRLQD 1950
1951 DHRNLRKWLTNQEEKWKGMEEPGEKTELFCQALARKREQFESVAQLNNSL 2000
2001 KEYGFTEEEEIIMEATCLMDRYQTLLRQLSEIEEEDKLLPTEDQSFNDLA 2050
2051 HDVIHWIKEIKESLMVLNSSEGKMPLEERIQKIKEIILLKPEGDARIETI 2100
2101 MKQAESSEAPLVQKTLTDISNQWDNTLHLASTYLSHQEKLLLEGEKYLQS 2150
2151 KEDLRLMLIELKKKQEAGFALQHGLQEKKAQLKIYKKFLKKAQDLTSLLK 2200
2201 ELKSQGNYLLECTKNPSFSEEPWLEIKHLHESLLQQLQDSVQNLDGHVRE 2250
2251 HDSYQVCVTDLNTTLDNFSKEFVSFSDKPVDQIAVEEKLQKLQELENRLS 2300
2301 LQDGTLKKILALAKSVKQNTSSVGQKIIKDDIKSLQCKQKDLENRLASAK 2350
2351 QEMECCLNSILKSKRSTEKKGKFTLPGREKQATSDVQESTQESAAVEKLE 2400
2401 EDWEINKDSAVEMAMSKQLSLNAQESMKNTEDERKVNELQNQPLELDTML 2450
2451 RNEQLEEIEKLYTQLEAKKAAIKPLEQTECLNKTETGALVLHNIGYSAQH 2500
2501 LDNLLQALITLKKNKESQYCVLRDFQEYLAAVESSMKALLTDKESLKVGP 2550
2551 LDSVTYLDKIKKFIASIEKEKDSLGNLKIKWENLSNHVTDMDKKLLESQI 2600
2601 KQLEHGWEQVEQQIQKKYSQQVVEYDEFTTLMNKVQDTEISLQQQQQHLQ 2650
2651 LRLKSPEERAGNQSMIALTTDLQATKHGFSVLKGQAELQMKRIWGEKEKK 2700
2701 NLEDGINNLKKQWETLEPLHLEAENQIKKCDIRNKMKETILWAKNLLGEL 2750
2751 NPSIPLLPDDILSQIRKCKVTHDGILARQQSVESLAEEVKDKVPSLTTYE 2800
2801 GSDLNNTLEDLRNQYQMLVLKSTQRSQQLEFKLEERSNFFAIIRKFQLMV 2850
2851 QESETLIIPRVETAATEAELKHHHVTLEASQKELQEIDSGISTHLQELTN 2900
2901 IYEELNVFERLFLEDQLKNLKIRTNRIQRFIQNTCNEVEHKIKFCRQFHE 2950
2951 KTSALQEEADSIQRNELLLNQEVNKGVKEEIYNLKDRLTAIKCCILQVLK 3000
3001 LKKVFDYIGLNWDFSQLDQLQTQVFEKEKELEEKIKQLDTFEEEHGKYQA 3050
3051 LLSKMRAIDLQIKKMTEVVLKAPDSSPESRRLNAQILSQRIEKAKCLCDE 3100
3101 IIKKLNENKTFDDSFKEKEILQIKLNAEENDKLYKVLQNMVLELSPKELD 3150
3151 EKNCQDKLETSLHVLNQIKSQLQQPLLINLEIKHIQNEKDNCEAFQEQVW 3200
3201 AEMCSIKAVTAIEKQREENSSEASDVETKLREFEDLQMQLNTSIDLRTNV 3250
3251 LNDAYENLTRYKEAVTRAVESITSLEAIIIPYRVDVGNPEESLEMPLRKQ 3300
3301 EELESTVAHIQDLTEKLGMISSPEAKLQLQYTLQELVSKNSAMKEAFKAQ 3350
3351 ETEAERYLENYKCYRKMEEDIYTNLSKMETVLGQSMSSLPLSYREALERL 3400
3401 EQSKALVSNLISTKEELMKLRQILRLLRLRCTENDGICLLKIVSALWEKW 3450
3451 LSLLEAAKEWEMWCEELKQEWKFVSEEIEREAIILDNLQEELPEISKTKE 3500
3501 AATTEELSELLDCLCQYGENVEKQQLLLTLLLQRIRSIQNVPESSGAVET 3550
3551 VPAFQEITSMKERCNKLLQKVQKNKELVQTEIQERHSFTKEIIALKNFFQ 3600
3601 QTTTSFQNMAFQDHPEKSEQFEELQSILKKGKLTFENIMEKLRIKYSEMY 3650
3651 TIVPAEIESQVEECRKALEDIDEKISNEVLKSSPSYAMRRKIEEINNGLH 3700
3701 NVEKMLQQKSKNIEKAQEIQKKMWDELDLWHSKLNELDSEVQDIVEQDPG 3750
3751 QAQEWMDNLMIPFQQYQQVSQRAECRTSQLNKATVKMEEYSDLLKSTEAW 3800
3801 IENTSHLLANPADYDSLRTLSHHASTVQMALEDSEQKHNLLHSIFMDLED 3850
3851 LSIIFETDELTQSIQELSNQVTALQQKIMESLPQIQRMADDVVAIESEVK 3900
3901 SMEKRVSKIKTILLSKEIFDFSPEEHLKHGEVILENIRPMKKTIAEIVSY 3950
3951 QVELRLPQTGMKPLPVFQRTNQLLQDIKLLENVTQEQNELLKVVIKQTNE 4000
4001 WDEEIENLKQILNNYSAQFSLEHMSPDQADKLPQLQGEIERMEKQILSLN 4050
4051 QRKEDLLVDLKATVLNLHQHLKQEQEGVERDRLPAVTSEEGGVAERDASE 4100
4101 RKLNRRGSMSYLAAVEEEVEESSVKSDNGDEKAEPSPQSWSSLWKHDKDM 4150
4151 EEDRASSSSGTIVQEAYGKISTSDNSMAQILTPDSLNTEQGPECSLRPNQ 4200
4201 TEEGTTPPIEADTLDSSDAQGGLEPRVEKTRPEPTEVLHACKTQVAELEL 4250
4251 WLQQANVAVEPETLNADMQQVLEQQLVGCQAMLTEIEHKVAFLLETCKDQ 4300
4301 GLGDNGATQHEAEALSLKLKTVKCNLEKVQMMLQEKHSEDQHPTILKKSS 4350
4351 EPEHQEALQPVNLSELESIVTERPQFSRQKDFQQQQVLELKPMEQKDFIK 4400
4401 FIEFNAKKMWPQYCQHDNDTTQESSASNQASSPENDVPDSILSPQGQNGD 4450
4451 KWQYLHHELSSKIKLPLPQLVEPQVSTNMGILPSVTMYNFRYPTTEELKT 4500
4501 YTTQLEDLRQEASNLQTQENMTEEAYINLDKKLFELFLTLSQCLSSVEEM 4550
4551 LEMPRLYREDGSGQQVHYETLALELKKLYLALSDKKGDLLKAMTWPGENT 4600
4601 NLLLECFDNLQVCLEHTQAAAVCRSKSLKAGLDYNRSYQNEIKRLYHQLI 4650
4651 KSKTSLQQSLNEISGQSVAEQLQKADAYTVELENAESRVAKLRDEGERLH 4700
4701 LPYALLQEVYKLEDVLDSMWGMLRARYTELSSPFVTESQQDALLQGMVEL 4750
4751 VKIGKEKLAHGHLKQTKSKVALQAQIENHKVFFQKLVADMLLIQAYSAKI 4800
4801 LPSLLQNRETFWAEQVTEVKILEEKSRQCGMKLQSLLQKWEEFDENYASL 4850
4851 EKDLEILISTLPSVSLVEETEERLVERISFYQQIKRNIGGKHARLYQTLN 4900
4901 EGKQLVASVSCPELEGQIAKLEEQWLSLNKKIDHELHRLQALLKHLLSYN 4950
4951 RDSDQLTKWLESSQHTLNYWKEQSLNVSQDLDTIRSNINNFFEFSKEVDE 5000
5001 KSSLKTAVISIGNQLLHLKETDTATLRASLAQFEQKWTMLITQLPDIQEK 5050
5051 LHQLQMEKLPSRKAITEMISWMNNVEHQTSDEDSVHSPSSASQVKHLLQK 5100
5101 HKEFRMEMDYKQWIVDFVNQSLLQLSTCDVESKRYERTEFAEHLGEMNRQ 5150
5151 WHRVHGMLNRKIQHLEQLLESITESENKIQILNNWLEAQEERLKTLQKPE 5200
5201 SVISVQKLLLDCQDIENQLAIKSKALDELKQSYLTLESGAVPLLEDTASR 5250
5251 IDELFQKRSSVLTQVNQLKTSMQSVLQEWKIYDQLYDEVNMMTIRFWYCM 5300
5301 EHSKPVVLSLETLRCQVENLQSLQDEAESSEGSWEKLQEVIGKLKGLCPS 5350
5351 VAEIIEEKCQNTHKRWTQVNQAIADQLQKAQSLLQLWKAYSNAHGEAAAR 5400
5401 LKQQEAKFQQLANISMSGNNLAEILPPALQDIKELQHDVQKTKEAFLQNS 5450
5451 SVLDRLPQPAESSTHMLLPGPLHSLQRAAYLEKMLLVKANEFEFVLSQFK 5500
5501 DFGVRLESLKGLIMHEEENLDRLHQQEKENPDSFLNHVLALTAQSPDIEH 5550
5551 LNEVSLKLPLSDVAVKTLQNMNRQWIRATATALERCSELQGIGLNEKFLY 5600
5601 CCEKWIQLLEKIEEALKVDVANSLPELLEQQKTYKMLEAEVSINQTIADS 5650
5651 YVTQSLQLLDTTEIENRPEFITEFSKLTDRWQNAVQGVRQRKGDVDGLVR 5700
5701 QWQDFTTSVENLFRFLTDTSHLLSAVKGQERFSLYQTRSLIHELKNKEIH 5750
5751 FQRRRTTCALTLEAGEKLLLTTDLKTKESVGRRISQLQDSWKDMEPQLAE 5800
5801 MIKQFQSTVETWDQCEKKIKELKSRLQVLKAQSEDPLPELHEDLHNEKEL 5850
5851 IKELEQSLASWTQNLKELQTMKADLTRHVLVEDVMVLKEQIEHLHRQWED 5900
5901 LCLRVAIRKQEIEDRLNTWVVFNEKNKELCAWLVQMENKVLQTADISIEE 5950
5951 MIEKLQKDCMEEINLFSENKLQLKQMGDQLIKASNKSRAAEIDDKLNKIN 6000
6001 DRWQHLFDVIGSRVKKLKETFAFIQQLDKNMSNLRTWLARIESELSKPVV 6050
6051 YDVCDDQEIQKRLAEQQDLQRDIEQHSAGVESVFNICDVLLHDSDACANE 6100
6101 TECDSIQQTTRSLDRRWRNICAMSMERRMKIEETWRLWQKFLDDYSRFED 6150
6151 WLKSAERTAACPNSSEVLYTSAKEELKRFEAFQRQIHERLTQLELINKQY 6200
6201 RRLARENRTDTASRLKQMVHEGNQRWDNLQRRVTAVLRRLRHFTNQREEF 6250
6251 EGTRESILVWLTEMDLQLTNVEHFSESDADDKMRQLNGFQQEITLNTNKI 6300
6301 DQLIVFGEQLIQKSEPLDAVLIEDELEELHRYCQEVFGRVSRFHRRLTSC 6350
6351 TPGLEDEKEASENETDMEDPREIQTDSWRKRGESEEPSSPQSLCHLVAPG 6400
6401 HERSGCETPVSVDSIPLEWDHTGDVGGSSSHEEDEEGPYYSALSGKSISD 6450
6451 GHSWHVPDSPSCPEHHYKQMEGDRNVPPVPPASSTPYKPPYGKLLLPPGT 6500
6501 DGGKEGPRVLNGNPQQEDGGLAGITEQQSGAFDRWEMIQAQELHNKLKIK 6550
6551 QNLQQLNSDISAITTWLKKTEAELEMLKMAKPPSDIQEIELRVKRLQEIL 6600
6601 KAFDTYKALVVSVNVSSKEFLQTESPESTELQSRLRQLSLLWEAAQGAVD 6650
6651 SWRGGLRQSLMQCQDFHQLSQNLLLWLASAKNRRQKAHVTDPKADPRALL 6700
6701 ECRRELMQLEKELVERQPQVDMLQEISNSLLIKGHGEDCIEAEEKVHVIE 6750
6751 KKLKQLREQVSQDLMALQGTQNPASPLPSFDEVDSGDQPPATSVPAPRAK 6800
6801 QFRAVRTTEGEEETESRVPGSTRPQRSFLSRVVRAALPLQLLLLLLLLLA 6850
6851 CLLPSSEEDYSCTQANNFARSFYPMLRYTNGPPPT 6885
Positively and negatively influencing subsequences are coloured according to the following scale:
(non-nuclear) negative ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| positive (nuclear)
What does the NucPred score mean?
You have to decide on a NucPred score threshold. Sequences which score greater than or equal to this threshold are predicted to spend some time in the nucleus. Higher thresholds yield fewer predicted nuclear proteins, but these predictions are more accurate (you can have higher confidence in them). The table below gives more details of the performance of NucPred estimated using the sequences it was trained on (by cross-validation). Another benchmark is available in the Bioinformatics 2007 paper. |
NucPred score threshold | Specificity | Sensitivity |
see above | fraction of proteins predicted to be nuclear that actually are nuclear | fraction of true nuclear proteins that are predicted (coverage) |
0.10 | 0.45 | 0.88 |
0.20 | 0.52 | 0.83 |
0.30 | 0.57 | 0.77 |
0.40 | 0.63 | 0.69 |
0.50 | 0.70 | 0.62 |
0.60 | 0.71 | 0.53 |
0.70 | 0.81 | 0.44 |
0.80 | 0.84 | 0.32 |
0.90 | 0.88 | 0.21 |
1.00 | 1.00 | 0.02 |
Sequences which score >= 0.8 with NucPred and which
are predicted by PredictNLS to contain an NLS have been shown to be 93% correct with a coverage of 16%. (PredictNLS by itself is 87% correct with 26% coverage on the same data.) |
Go back to the NucPred Home Page.