SBC logo Authors: Amine Heddad, Andrea Krings, Markus Brameier and Bob MacCallum, Stockholm Bioinformatics Center, Stockholm University, Sweden.

NucPred

Fetching Q09165 from www.uniprot.org...

The NucPred score for your sequence is 0.33 (see score help below)

   1  MGGRNWLFRSAVLVSTLLTCISIAQELLPSIEVESLAQDLQIQEWMRTLR    50
51 RVKRAPTRNNRPEPVVVGRNGTGKCVISADRASHFCGMEEEVSAPPIPPP 100
101 DEGKCIISKASGREICYPSYSQLDTSCTDVTGQSSNGLVVPPVVPHATVR 150
151 AMAFVPPDNLRRLIIQYYRQQGKHQPKNATFSPKSFLFVKYHCDYGYEMV 200
201 DEVDTMFCQDKKWVMTPPMCRGQGLCAADNGGCSHTCISYNDEKIECKCP 250
251 RGMTLDVDEKTCIKPIPKSLCRSLSGCTCNGITETQFACSCGDNKQKCLL 300
301 IAGPPRIYIEPQGPYEVAPGGNINISCTSVAYPFPDIYWFKNQKVNTDGP 350
351 DQNTLRASQILIIKEIYRNEEFTCVSDNIHGSANRTVSIVVTGPGSAPHL 400
401 KSASAGRTSLTVRWEPPSIINRPITTYTLYYTNNPQQPVKNWKKLEVKEP 450
451 TREVAIPDLRPDTAYYIRVRANDPLGPGKLGNQVQIKTLKPAVRPYVNIV 500
501 EGDEIRVPPMTAFEIDCNVTRADPVPVLVWLHKGRPLNKGSKTQHIKMKN 550
551 GGVLESTQFSCVAENEAGKSTKKINVTVTGPSAPERIRYQIDGDKVTLQW 600
601 EPPQITNGPMAGYDVFYTEDPSLPRDQWKVHHIDDPNARTTTVLRLNEKT 650
651 PYTFVIVGRNRLGPGLPSAPFTATTWLAAKPPVVQLEPSEEMTKEPSNDE 700
701 MIIECGAQGVPKPKIIWLWSGTLIEDGKEEFRVYDTTPTDAQDRTRSKLI 750
751 AQSTTRSGVATCQAVNSEGSDEKKVPVKILGPGSAPLGITPTPMHTGFDV 800
801 AWKPPKVTNGRITDYVVYYSKDPDAPLSDWESKTVPADTRNLTVNVDDED 850
851 TPYVVKVQARTDDGPGIISEAYEVTTGRKQVPLSVRLEIADPSVDPSTGE 900
901 TIVEPTQPIHFRCVADGRPMPSVSYSWLPINASESGDEPVPIPIHSDDSQ 950
951 PHHYNSIQVYSTTATKRILLCQARNPDGTVDDRHVFIVNKPGSAPQNPEV 1000
1001 IVDPDNRVTITWQPPKYPNGEITSYNVYITGDPSLPVDQWQVFPVDDVTD 1050
1051 PKLVLQRGALQPETPYFVKIAAVNPHGEGIHTDPKHFDTVSGAPIDAPTD 1100
1101 VLPSVSIDNTVNITWSPPTQPLGPIKSYTVYFAPEYDDSDFKTWQRISVD 1150
1151 APDGADHGEVTLPKEQFNPNTPYKIRISATNDLSEGPASDPVRFETGSGE 1200
1201 IPPTITLDPSNSTYTVEPLGAATITCTATGVPQPKVHWIKANGETVDSAT 1250
1251 LQLYDLVKDTSATCVAENNAGKTQEAVSIQVTGPGTAPNEIVLLPMPNQE 1300
1301 INVEWTSPDEVNGQITNYIIHYGEISEDGSEPATWDQVTIARDDVNHKLA 1350
1351 NLEPKKTYAIRVQAVSDRGPGVISAPQVIKTLPLAPQAITNPIIQVHPNN 1400
1401 SVTIEFTPPDDPENPGKKVKDFVIQYTTDEEPDDESVWKELKFTDPDDTD 1450
1451 DTTIVSIDGENFNPDTKYNTRIIARGEIDSQPNEPTLFATGDGVIAPSQP 1500
1501 SFNVDTEDGVIRVPAGTDYTIKCVSEGYPAPDVRWVDSHGNQLSDGPLLR 1550
1551 IIDIRKTLNAKCLAENRGGLKETDLTIFVAGPGTAPENIQLTANKPTTIS 1600
1601 VQYEVPSIPNGNISKYIIYYTPLDDQDPDHQLGQVQTKPISDWQNVHDMN 1650
1651 DGVEGPRKVDIKDFVSTDTAYAVVVQAINDDGPGPYSNQYTIRTMSRARE 1700
1701 GPPVELRVEPDGQRSAVAQWKEPVTSDVPPIGYEIYYVRGDKSVEEDDSA 1750
1751 GLNDWIKISIDDPTKLTHKIQNLLLPDTDYVFKMRAIYPDGPSVFSEPCI 1800
1801 MKTLPDGNAPYIQISTGDNGVEGSTTIQILPGSQMTIACNATGIPLPQVK 1850
1851 WIKAGNYEIDPSRVDADGNHAQFSLQVANITEDTTFNCVAQNPLGHANWT 1900
1901 INVNLIEGLEPNWRDDFVTSKSDGGQIVLVFNDELPEYLKPPNEWTIQYT 1950
1951 DDAEQPKDQWESIPSGGAPLTRVEVPNMNPGTFYYLVVDNPEKGIQTPTL 2000
2001 VVMTPKPPSDIRFGKNNDDEQIVDFKPAVASEPIKEYTISVWPSTDPSNV 2050
2051 KKFTTPADVTSGVVVDGLEPDTEYNVQVAAEFYEGEELASEPVTVKTPPR 2100
2101 DVSCECDHGCAFEMNEDAGTMEPKCYCHGGFHLTSDGKSCERDEEDDATS 2150
2151 QAVLQVTPPSITTKVAPEELLTGSSGEVDSTPETLSPVVGPDGKPLVLDK 2200
2201 KGKPIDSSGKPVKFDENGDPIAPEGTKLEKNDNGEWVYPLVDRNGKPLPV 2250
2251 DENNKPIITVIDKDGRVVTETDDGTFVTSDGKQVEVDDLGRPLDEDGNPY 2300
2301 KTNENGQFVISDVDGAVEGDDEEEQPQVIPLYVVDVDDDGKYLDEDGNEI 2350
2351 PVNEDGDPIDVNGKPLEKNEDGKFVKPKESTQETPQPTKITIVSPDGTPL 2400
2401 PTDASGSPIGLDGQPVPTDASGKPLAKDGSPLPTDNNGNYVILPSSKNSV 2450
2451 DSQPTDDAGRVIYPVVLPDGSPLATDSTGNFVNRHGDIVERDDEGKPMGP 2500
2501 DGQLLPTDASGNYIYPVTGPNDEVLPTDANGNPIYPVVGPDGTPLPTDAS 2550
2551 GAIVGPDGQPIPTDSNGKPLSKEGYPLPVDNQGNYILLPTEIDAAQSLPT 2600
2601 DDAGMPVYPIVKPDGTPLATDSTGSFINDNGEIIEKDDEGRPFGPDGLIL 2650
2651 PTDASGNYIYPAMGLDGQPLATDASGNYVLVSTEQTVTKSYPVDDSDITI 2700
2701 HPIVNPDGTPLATDSTGSYVTEDGQIIEKDDEGRPLGPDGQVLPTDDSGN 2750
2751 YIYPVAESGEETKPTDASGKTVYPVRGPDGTPLPTDASGAVIGPDGEVIP 2800
2801 TDENGIPLSQDGSPLPTDNQGNYILVLTSETPTKTLPIDESGNVVYPITK 2850
2851 PDGTPLATDSTGSFVTEDGTIIAKDDEGKPLGPDGEVLPTDASGNYVYPV 2900
2901 TVSDEQTLPTDDTGKTVYPIRGPDGTPLPTDASGPVIGPDGEIMPTDENG 2950
2951 IPLSKDGTPLPTDNDGNYVIVPSDEETSKELPIDDSGNVIYPITKPDGTP 3000
3001 LATDSTGSFVTEDGTIIEKDDEGKPLGPDGQILSTDASGNYVYPDPGLDS 3050
3051 QILTTDVYGKPIYTVIGPDGTALPTDASGAAIGPDGTPISTDETGEPLDK 3100
3101 DGSILPTDDYGNFVFVVSQELPTDAEVQTPITKPDGTLLATDSSGNYVND 3150
3151 NGDIIEKDDEGKPLGPDGEVLPTDGTGNFIYPATTSDGEVIPTDDSGKPL 3200
3201 YTIRGPDGTPLPTDETGSALGPDGEPISTDSSGKPLSKDGSPLPTDNNGN 3250
3251 YVLVPTDESTTKALPTDESGNVVIPITNPDGTSLATDSTGSFVTDDGQII 3300
3301 EKDDEGKPLGPDGAILPTDASGNYIYPVVGPDGQALPTDETGKTVYPVRG 3350
3351 PDGTPLPTDASGAVMGPDGEPIPTDANGKPLSKDGSPLPTDASGNYVLVP 3400
3401 SDEVTAKELPTDESGTIVYPVTRADGTPLATDSTGSFVTDDGQIIGKDDE 3450
3451 GKPLGPDGQVLPTDDSGNYIYPAVGPDGQAFPTDKSGKPLYPVRGPDGTS 3500
3501 LPTDASGAAIGPDGEVIPTDENGIPLDKDGSPLPTDASGNYIIVPSGELT 3550
3551 MASHPTDDTGNVIYPITKPDGTLLSTDSTGSFVTEDGQIIEKDDEGKPLG 3600
3601 PNGEALPTDDLGNYIYPITDSDEQTSPTEDVGTSVHLVRGPDGTPLPTDA 3650
3651 SGSAIGPDGEVIPTDENGVPLDKDGSPLPTDNNGNYVLVPTKESVTKILP 3700
3701 TDDSEAVVHPITRQDGTPLSTDSTGNFVTDNGEIIEKDDEGRPVGPDGQV 3750
3751 LSTDVSGNFVYPVTESPNDGEKPIHPVLGPDGSPLPTDDSGAVIGPDGEV 3800
3801 IPTDASGVPLSKLGLPLPTDSDGNYIILSSDTDVTKELPTDDTGNVIYPI 3850
3851 TKPDGTPLGTDTSGSFVSDDGQIIEKDDDGKPLGPDGQVLPTDATGNFIY 3900
3901 PVLGPDGQALPTDESGKTVYPVRRPDGNPLPTDASGAVIGPGGEPIPTDS 3950
3951 SGKPLSADGSPLPTDASGNYVLVPSDEVTAKELSTDESGTIVYPVTRADG 4000
4001 TPLATDSTGSFVTDEGQTIEKDDEGKPLGPDGQVLPTYASGNYIYPVIGP 4050
4051 DGQALPTDESGKTVYPVRGPDGTPLPTDVSGAVIGPDGEVIPTDSNGIPL 4100
4101 SQDGTPLPTDNQGNYILVPTSETATKALPTDESGNVIYPITKADGTPLAT 4150
4151 DSTGTFVTDDGQIIEKDDEGKPLGPDGQVLPTDDSGNYIYPVVGPDGQTD 4200
4201 ESGKTVYPVRGSHPTDDTGNVIYPITKPDGTLLATDSTGSFVTEDGQIIE 4250
4251 KDDEGKPLGPDGQVLPTDESGNYVYPEVKSDEQLLPTDHTGKTVYPVHGP 4300
4301 DGTPLPTDDSGAIIGPDGEVIPTDENGIPLSKDGSQLPTDNNGNYVLVPS 4350
4351 DEGATKTHPTDETSDAVHPITKPDGTPLATDSTGNFVTENGDVITKDEEG 4400
4401 KPLGPNGQILPTDASGNYIYPVIGPDGQALPTDESGKTVYPVRGPDGTPL 4450
4451 PTDASGAVISPDGEVIPTDANGIPLDKDGSPLPTDASGNYILVPSEQDIT 4500
4501 KTLPTDDSGNVIYPITKPDGTSLATDSTGSFVTEGGEIVERDEDGKPLGP 4550
4551 DGQVLPTDASGNYIYPVVGPDGQVLPTDDTGKTVYPVYGPDGIPLSTDAS 4600
4601 GAVIGPDGEPIPTDASGRPLDKDGSFLSTDASGNYILVPSDAPTNEAGPV 4650
4651 VVQHQITRPDGTPLATDSSGHFVTEDGVIIENDKEGRPIGADGQVLPTDA 4700
4701 SNNYIFTDVPTQGYAVFIPTDVVPIELEAPNCDQVDGRVDTLLFVVESSH 4750
4751 TSAPYLDTLKKLIENLLLTTPRDFLPKIGTLIYSATTEITIDIGSYGDFK 4800
4801 ELFDSTNEIREIGGIPDVTNALRTAKMILEETSRGDTLVLHLLASPMRTS 4850
4851 SKVYTERIRALPNTRLIHLNEKQWAEDPNAVELLRSHLCIPSEVPLPSMM 4900
4901 PTDASGNLLSIPTDEVVTDGTPTDESGFVIYPITKPDGTPLATDSTGSFV 4950
4951 TEDGQIIEKNEDGKPLGPDGQVLPTDNSGNYIYPIVGPDGQALPTDASGK 5000
5001 PIYPVRGPDGTPLPTDASGAVIGPDGEPIPTDASGKPLAQDGSPLPVDNE 5050
5051 GNYIILPTQQVDTKEYPTDETGNVIVPITKPDGTLLPTDSTGSFVTENGD 5100
5101 RIEFNEEGKPLGPDGEVLATDASGNYVYPGSVVEPTAEPQEVTHGPDGQV 5150
5151 LPTDASGKPIYPVRGPDGIPLPTDASGAAIGPDGETIATDENGIPLSKDG 5200
5201 SPLPTDNTGNYVLVPSDEGATEEKPTQGSESIVHPITKPDGTPLATDSTG 5250
5251 SFVTDDDQVIAKDEDGKPIGPDGQVLPTDSSGNYIYPVIGPDGQALPTDE 5300
5301 SGKTVYPVRGPDGTPLSTDASGAVIGSDGKPIPTDETGLPLNKDGSPLPT 5350
5351 DNDGNYILIPADESVVKALPTDEAKEVYPIVQPDGTPLATDSSGNFVTSS 5400
5401 GDIIDIDDEGKPLGPDGQALPTDDSGNYIYPVIGPDGQALPTDESGKTVY 5450
5451 PIRGPDGTPLPTDASGAVIGPDGEPIPTDASGKPLSQDGSPLPTDASGNY 5500
5501 ILVPSDGEVTKTLPTDDVGNVIYPITKPDGTPLATDSTGSFVTDDGQIIE 5550
5551 KDDEGKPLGPDGQVLSTDDSGNYIYPAVGPNGQTIPTDDTGRTVYPVRGP 5600
5601 DGTPLPTDASGAVIGPDGEPIPTDASGKPLSADGSPLPTDNNGNYVIVPT 5650
5651 DGSTVKSHPTDDSGNTIYPVVNEDGTPLSTDLSGNFLTNSGEIVDRDDEG 5700
5701 KPLGPDGQTLPTDASGNYVYLQKVEETTKPLPTDESGNIVYPITKPDGTP 5750
5751 LATDSTGSFVTEDGTVIEKDDEGKPVGPDGQVLPTDESGNYIYPDVTPDG 5800
5801 QVQPTDVSGKPVYPVRGPDGSTLPTDASGAALGPDGKPIPTDSNGVPLSE 5850
5851 DGSPLPTDNQGNYVLVPTSETVTKSMPTDDNRNVIYPITMSDGSLLSTDS 5900
5901 TGSFVTEDGKVIEKDDEGKPLGPDGQVLPTDASGNYIYPVHGQDGTPLPT 5950
5951 DASGAVIGPDGSPLPTDDSGAVIGPDGEVIPTDSNGIPLNKDGLPLSTDA 6000
6001 SGNYIVVSAEQPGEEIKEIPITKPDGTLLSTDSTGNFITENGEIIERDDE 6050
6051 GKPIGPDGQILPTDASGNYVYPVIGPDGQGLPTDESGKTIYPVRGPDGTP 6100
6101 LPTDASGAVIGPDGEPIPTDASGKPLSQDGSLLPTDNNGNYVLLPSNEET 6150
6151 TQGLTTDESVNVIYPITKPDGTPLATDSTGNFVTDNGETIEKDEEGKPIG 6200
6201 PDGQTLPTDDSGNYIYPVVGPDGQALPTDESGKTIYPVHGPDGTPLPTDA 6250
6251 SGASIGPDGEPIPTDTSGKPLFKDGSPLPTDSNGNFIIVPSEKRMDEELP 6300
6301 TDDSGKIIYPITKPDGTPLASDSTGVFVTEDGTIIEKDDDGKPLGPDGQV 6350
6351 LPTDASGNYIYPIVGPDGKTQPTDESEKTPYPVHGPDGTPLPTDASGAVI 6400
6401 GPDGEPIPTDASGKPLSADGSPLPTDNNGKYVLVPADEVTTKVLPTDDSG 6450
6451 NVVHPITRPDGTPLGTDASGSFITDDGQAIEKDDEGKPIGPDGQILPIDA 6500
6501 SGNYIYPVIGPDGQALPTDESGKTVYPVRGPDGTPLPTDASGAVIGLDGE 6550
6551 PIPTDASGKPLSRNGSPLSTDSSGNYIFVPTDDEKKDSKKCDISSSLSDI 6600
6601 IFVLVNDGDGAQNYDQFKKAVVGFSRKVDMSPDIIRLAVLSVGSEIAVPL 6650
6651 PLGGYQEKEHLSSILNSFEIPPIVGTEILSPVQAANQQFTSFPRTGISKM 6700
6701 VVIFADNEEKSTFIGGATYITVKYGTTPKDIINTLIEACEKGLVEIVPDD 6750
6751 TKHVIDETVPTISSTPVIVDQSGKPLPTDASGNYIDNNGKPIVIEGEEPT 6800
6801 GPEDQKLSKNKKGEWVYPLVDKFGKPVETDDNDKPVITVVDNDGNELSKN 6850
6851 DDGNWIDLSGNEIDTDELGRPLDSEGNPYKFDDNGHVVIAPQIEEEEETT 6900
6901 PAIPFIIIDGEPINEDDGVYTDKDGNVIPTNSEGKPIDENGQVLPKNEDG 6950
6951 EFVKPKEADTTQSTIVSPDGSPLPTDASGAAIGPDGEPIPTDSNGRPLAK 7000
7001 DGSPLPTDNNGRYVILPSGRYSGDTETTDESGNVIYPIINPDGTPLGTDS 7050
7051 TGNYITSIGDIIERDDEGKPIGPDGQVLTTDASGNYIYPVVGPDGLILPT 7100
7101 DATGKPIYPVRGPDGTPLPTDASGAVIGPNGEPIPTDASGKPLSQDGSPL 7150
7151 PTDVNGNYIMLPSDEVTSQSLPTDESGNVIYPITKPDGTPLGTDSSGSFI 7200
7201 TEDGQIIEKDDEGKPIGPDGQILSTDASGNYIYPDVGPDVQTLPTDGDMI 7250
7251 SVPTVEATVEFTSDKTPEVIHSITKPDGTPLSTDSTGEFVTEDGQIIEKD 7300
7301 DEGKPIGPDGQVLPTDASGNYIYPVIGLDGQALPTDKSGKTVYPVRGPNG 7350
7351 TPLPTDASGAVIGLDGEPIPTDASGKPLSADGSPLPTDAVGNYILVPSDD 7400
7401 GVIRTHPTDESGNTIYPITKPDGTPLATDSTGAFVTDDGQVIEKDDEGKP 7450
7451 IGPDGQVLPTDASGNYIYPVTSSDGQVLPTDAEKPVIVDQSGKPLPTDAS 7500
7501 GNYIDNNGKPIVIEGEEPTGPEDQKLSKNKKGEWVYPLVDKFGKPVETDD 7550
7551 NDKPVITVVDNDGNELSKNDDGNWIDLSGNEIDTDELGRPLDSEGNPYKF 7600
7601 DDNGHVVIAPQIEEEEEATPAIPFIIIDGEPINEDDGVYTDKDGNVIPTN 7650
7651 SEGKPIDENGQVLPKNEYGEFVKPKEADTTQSTIVSPDGSPLPTDASGAA 7700
7701 IGPDGEPIPTDSSGRPISKDGSPLPTDASGNYILVPSGEGVTDSLPTDEA 7750
7751 GNIIYPITKPDGTLLATDSTGSFVADDGQIIEKDDEGKPIGPDGQVLPTD 7800
7801 ASGNYIYPVIGPDGQALPTDESGKTVYPVRGPDGTPLPTDASGAVIGPDG 7850
7851 EPIPTDPSGKPLSADGSPLPTDINGNYVLVPSDESAAKVLPTDESGSVVY 7900
7901 PITKPDGTPLGTDASGSFVTDDGQAIGKDDEGKPIGPDGQTLPIDDSGNY 7950
7951 IYPVVGPDGQALPTDESGKTVYPVLGPDGIPLPTDASGAVIGPDGEIIPT 8000
8001 DASGKPLSADGSPLPTDNNGNYVLVPADEVTTKVLPTDDSGNVVHPITRP 8050
8051 DGTPLGTDASGSFVTDDGQAIEKDDEGKPIGPDGQVLPTDASGNYIYPVI 8100
8101 GPDGQALPTDKSGKTVYPVRGPDGTPLSTDASGALIGLDGEPIPTDASGK 8150
8151 PLSADGSPLPTDAVGNYILVPSDDGVIRTHPTDESGNTIYPITKPDGTPL 8200
8201 ATDSTGAFVTDDGQVIEKDDEGKPIGPDGQVLPTDASGNYIYPVTSSDGQ 8250
8251 VLPTDAEKPVIVDQSGKPLPTDASGNYIDNNGKPIVIEGEEPTGPEDQKL 8300
8301 SKNEKGEWVYPLVDKFGKPVETDDNDKPVITVVDNDGNELSKNDDGNWID 8350
8351 LSGNEIDTDELGRPLDSEGNPYKFDDNGHVVIAPQIEEEEETTPAIPFII 8400
8401 IDGEPINEDDGVYTDKDGNVIPTNSEGKPIDENGQVLPKNEDGEFVKPKE 8450
8451 ADTTQSTIVSPDGSPLPTDASGAAIGPDGEPIPTDSSGRPISKDGSPLPT 8500
8501 DASGNYILVPSGEGVTDSLPTDEAGNIIYPITKPDGTLLATDSTGSFVAD 8550
8551 DGQIIEKDDEGKPIGPDGQVLPTDASGNYIYPVIGPDGQALPTDESGKTV 8600
8601 YPVRGPDGTPLPTDASGAVIGPDGEPIPTDPSGKPLSADGSKLPTDINGN 8650
8651 YVLVPADEVTTKVLPTDDSGNVVHPITRPDGTPLGTDASGSFITEDGQIV 8700
8701 EKNDDGKPIGPDGQVLPTDSSDNYIYPSIGSDEQAMPTDTTGSVIYPLVS 8750
8751 PDGTVIEGPPKVAKPVGPDGKVLPTDASGHFIGPDGPIPTDYGVTYSDTV 8800
8801 TTPDGIPLSNDSTGAFITEDGTVIENNEDGKPIGPDGQVLPTDAYGNYIY 8850
8851 PAIGPDGQALPTDESGNPVYPVRGPDGTPLPTDVSGAVIGPDGEPIPTDA 8900
8901 SGKPLSADGGSPLPTDNNGNYVLVPADEVTTKVLPTDDSGNVVHPITRPD 8950
8951 GTPLGTDASGSFVRDDGQAIEKDDEGKPIGPDGQVLPTDASGNYIYPVIG 9000
9001 PDGQALPTDESGKTVYPVRGPDGTPLPTDASGAVIGLDGEPIPTDASGKP 9050
9051 LSAEGSPLPTDNNGNYVLVPADEVTTKVLPTDDSGNVVHPITRPDGTPLG 9100
9101 TDASGSFVRDDGQAIEKDDEGKPIGPDGQVLPTDASGNYIYPVIGPDGQA 9150
9151 LPTDESGKTVYPVRGPDGTPLPTDASGAVIGLDGEPIPTDASGKPLSAEG 9200
9201 SPLPTDNNGNYVLVPAHEVTTKVLPTDDSGNVVHPITRPDGTPLGTDASG 9250
9251 SFVTDDGQAIEKDDEGKPIGPDGQVLPTDASGNYIYPVTSSDGQVLPTDA 9300
9301 EKPVIVDQSGKPLPTDASGNYIDNNGKPIVIEGEEPTGPEDQKLSKNEKG 9350
9351 EWVYPLVDKFGKPVETDDNDKPVITVVDNDGNELSKNDDGNWIDLSGNEI 9400
9401 DTDELGRPLDSEGNPYKFDDNGHVVIAPQIEEEEEATPAIPFIIIDGEPI 9450
9451 NEDDGVYTDKDGNVIPTNSEGKPIDENGQVLPKNEDGEFVKPKEADTTQS 9500
9501 TIVSPDGSPLPTDASGAAIGPDGEPIPTDSNGRPLAKDGSPLPTDNNGRY 9550
9551 VILPSGRYSGDTETTDESGNVIYPIINPDGTPLGTDSTGNYITSIGDIIE 9600
9601 RDDEGKPIGPDGQVLTTDASGNYIYPVVGPDGLILPTDATGKPIYPVRGP 9650
9651 DGTPLPTDASGAVIGPNGEPIPTDASGKPLSQDGSPLPTDVNGNYIMLPS 9700
9701 DEVTSQSLPTDESGNVIYPITKPDGTPLGTDSSGSFITEDGQIIEKDDEG 9750
9751 KPIGPDGQILSTDASGNYIYPDVGPDVQTLPTDGDMISVPTVEATVEFTS 9800
9801 DKTPEVIHSITKPDGTPLSTDSTGEFVTEDGQIIEKDDEGKPIGPDGQVL 9850
9851 PTDASGNYIYPVIGLDGQALPTDKSGKTVYPVRGPNGTPLPTDASGAVIG 9900
9901 LDGEPIPTDASGKPLSADGSPLPTDAVGNYILVPSDDGVIRTHPTDESGN 9950
9951 TIYPITKPDGTPLATDSTGAFVTDDGQVIEKDDEGKPIGPDGQVLPTDAS 10000
10001 GNYIYPVTSSDGQVLPTDAEKPVIVDQSGKPLPTDASGNYIDNNGKPIVI 10050
10051 EGEEPTGPEDQKLSKNKKGEWVYPLVDKFGKPVETDDNDKPVITVVDNDG 10100
10101 NELSKNDDGNWIDLSGNEIDTDELGRPLDSEGNPYKFDDNGHVVIAPQIE 10150
10151 EEEEATPAIPFIIIDGEPINEDDGVYTDKDGNVIPTNSEGKPIDENGQVL 10200
10201 PKNEDGEFVKPKEADTTQSTIVSPDGSPLPTDASGAAIGPDGEPIPTDSS 10250
10251 GRPISKDGSPLPTDASGNYILVPSGEGVTDSLPTDEAGNIIYPITKPDGT 10300
10301 LLATDSTGSFVADDGQIIEKDDEGKPIGPDGQVLPTDASGNYIYPVIGPD 10350
10351 GQALPTDESGKTVFPVRGPDGTPLPTDASGAVIGPDGEPIPTDPSGKPLS 10400
10401 ADGSPLPTDINGNYVLVPSDESAAKVLPTDESGSVVYPITKPDGTPLGTD 10450
10451 SSGSYITEDGQLVGKDEEGKPVGPDGQVLPTDSAGHYVYPITGADRQILT 10500
10501 TDAAGKPIYSVFNEDGIQLPTDSSGYAIGHDGELVPTESTNGVPLNKDGT 10550
10551 PLPTNDSGHFVLVLPGATVNDSKPTDEVIVSITNPDGTLLGTDSTGAFVT 10600
10601 EDGPIIENDDEGKPVGPDGQVLPTDDSGNYIYPVIGPDGQALPTDESGKT 10650
10651 VYPIRGPDGTPLPTDASGASIGPDGEPIPTDASGKPLSKDGSPLPTDNDG 10700
10701 HYVLVPVDDSTIKAFPTDESGNVAYPITRPDGTPLGTDSSGSFVTDDGTI 10750
10751 IENDDEGKPIGPDGQVLPTDASGNYIYPVIGPDGQALPTDESGKTVYPVH 10800
10801 GPDGTPLPTDASGAAIGPDGEPIPTDASGKPLSQDGSALPTDNNGNFILV 10850
10851 PSDKSTTKTLPTDESGNFIYPITKPDGVLFATDSTGNYVTDEGELIEKDD 10900
10901 NGYPLGPDKRVLPTDGSGNYIYPAVGSDEKILPTDNLGKVVYPITRPDGS 10950
10951 PLATDSTGVFVTGDGTIVERNEEGKPIGPDGQVLTTDNSGNYIYPVIGPD 11000
11001 GEPLGTDASGKTVYPVRGPDGTPLATDAFGAVIGPDGEPIPTDASGKPLD 11050
11051 QSGFPLPTDNNGNYILVPSDEALGKILPTDENGNVVYSVTNPDGTPLATD 11100
11101 STGSFIASNGLIVEKDDEGKPIGPDGQVLPTDASGNYIYPVIGPDGQALP 11150
11151 TDESGKPIYPVFTEDGTQLPTDSTGFAIGPDGELVPTDSANGVPLSKDGS 11200
11201 PLPTDASGNYILPDSGVTTANPTDENGYAIYPITKPDGTLLATDSTGSYI 11250
11251 TQGGQLIEKDNTGKPIGPDGQVLPTDGSGNYVYPVVGPDGQALPTDDTGN 11300
11301 VVYPVINADGSLLATDSSGSFITENGKIVAKDDEGKPISPDGQVLPTDAS 11350
11351 GNYIYPALGPDGSILPTDSNGKSIYPVRGPDGTPLPTDEFGFAIGPDGKP 11400
11401 IPTDTSGKPLSADGSPLPTDNNGNYILVLSEGVTEHAPTDENGNVIYPVT 11450
11451 NPDGTPLGTDSSGAFITQDGTVVKKDEDGKPIGPDGQVLPTDNSGNYIYP 11500
11501 VIGPDGQVLPTDASGKTVHSVYGPDGTQLPTDASGSAIGPDGELVPTDVS 11550
11551 GRPLSQDGSPLPTDNNGNYALVVSDEATTKVLPTDEGGNVIYHITKPDGS 11600
11601 LLGTDASGDFITDHGKAVQKDDEGKPIGPDGSVLPTDTSGNYIYPITGPD 11650
11651 GNVLPTDSNGKPVYPVFNEDGTQLPTDSTGSAIDQDGELVSTDSTSGVPL 11700
11701 AKDGSPLPTNSAGNYVLVSSGKSQPTDEHGNVIYPITKPDGTLLATDSTG 11750
11751 SYLTEDGQLVEIDDSGKPLGSDGQVLPIDASGNYIYPALGPDGQALPTDD 11800
11801 AGNLVYPIVYPDGTPLATESTGNYVTENGEVVGKNTDGKPISPDGQVLPT 11850
11851 DASGNYIYPAVGPDGQVLPTDASGKLIYPVFHPDGTQLPTDASGYAVAPD 11900
11901 GSLIPTEFSGKPLGKDGSVLPTDNSGRYVLVHDDREVTQTIPTDESGNTI 11950
11951 YPITRPDGTLLSTDSTGIYLTDEGNVIDRDNEGKPLGPDGQVLPTDGYGN 12000
12001 FVYPADSDIGGAKLLPTDEYGHTLYPVIRPDGSLLSTESSGSFVTDDGTV 12050
12051 VSKDSDGKPLGPAGQVLPTDASGNYIYPSIGPDGSPLPTDINGKPAYTVI 12100
12101 GRYGDVLPTDSLGRAVNIDGSVVPTDDEGLPIDQYGVVLPTDTTRKLHTL 12150
12151 VPTRRPSSFCYVTSHIDLLLVIDSSNNIKVLDYRVMKELIKNFLTEHFNL 12200
12201 RKHQVRVGLVKYGDGAEIPVSLGDYDNEDDLVHRISESRRLKGRAQLGAG 12250
12251 LREALDELSISGVDGVPQIVLIVKNGKASDDYSSAVKSLKAERNVTVFVV 12300
12301 DAGDDESQQQNSELTEEDKTIVISQWRGADSEVLGPIADYICKIVPNVET 12350
12351 SRTWPTPRTKATTTSGTGRSCSSIDYESDVIIVLDSSENFTPDEFVSMKD 12400
12401 AVASIVDTGFDLAPDVSKIGFVIYSDKVAVPVALGHYEDKIELLEKITDA 12450
12451 EKINDGVAIALYGLNAARQQFQLHGRENATKVVILITNGKNRGNAAAAAE 12500
12501 DLRDMYGVQLFAVAVGSNPEELATIKRLVGNSNTENVIEVAQSTEIDDDA 12550
12551 AALLKAVCGNTSPKNSEMPAHLTTKRDVLAQKFTTAPMLRTTRAVAGGLC 12600
12601 NDGIRRPYHFNILVDITSRASADEFRRVLDHLINFFNDRMRDEQHMITIN 12650
12651 IITVNSDKVQNILSNLRADQLSEQLNAITQQSDDTVSPKLGAGIDALAEL 12700
12701 SKENYINGAIKLMLIVGSDGTSSDDALPAAEYANSDFQHNIIAVSVRKPA 12750
12751 TDLLSKIAGLPTRVVHLDQWSAPNELFDSWIAYITCDYATASTTRKSTTP 12800
12801 KMTTLRPYDRKASKEDATNIELIPLSPSSLSVSWTCCTNNKSNYTILYTH 12850
12851 DTSITKEKWIRKEVTCRDSFGTHLNELPSDHTYTVCVMTNERVDNSTALA 12900
12901 IDKNCDSLHIDQNTTAPEDYVKPSPSSCNCQCSEGKAVLRATCEMVIDTN 12950
12951 RPIATLPPATVDECPCKVKAHGGRCPKGYIAKDGQCYDIDECETNNGQCS 13000
13001 EGCVNTPGSYYCACPHGMMRDPLDPFNCVNTANSFDKIAALLANYLEANT 13050
13051 KNSGSEVTSEKSDGGRVNYKATIKSADDKTITFEWSHVPEVVRRAFKWLF 13100

Positively and negatively influencing subsequences are coloured according to the following scale:

(non-nuclear) negative ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| positive (nuclear)

with NucPred



If you find NucPred useful, please cite this paper:
NucPred - Predicting Nuclear Localization of Proteins. Brameier M, Krings A, Maccallum RM. Bioinformatics, 2007. PubMed id: 17332022
The authors also look forward to your comments and suggestions.

What does the NucPred score mean?

You have to decide on a NucPred score threshold. Sequences which score greater than or equal to this threshold are predicted to spend some time in the nucleus. Higher thresholds yield fewer predicted nuclear proteins, but these predictions are more accurate (you can have higher confidence in them). The table below gives more details of the performance of NucPred estimated using the sequences it was trained on (by cross-validation). Another benchmark is available in the Bioinformatics 2007 paper.

NucPred score threshold Specificity Sensitivity
see above fraction of proteins predicted to be nuclear that actually are nuclear fraction of true nuclear proteins that are predicted (coverage)
0.10 0.45 0.88
0.20 0.52 0.83
0.30 0.57 0.77
0.40 0.63 0.69
0.50 0.70 0.62
0.60 0.71 0.53
0.70 0.81 0.44
0.80 0.84 0.32
0.90 0.88 0.21
1.00 1.00 0.02

Sequences which score >= 0.8 with NucPred and which are predicted by PredictNLS to contain an NLS have been shown to be 93% correct with a coverage of 16%. (PredictNLS by itself is 87% correct with 26% coverage on the same data.)

Go back to the NucPred Home Page.