FgeneH output
[ EMBnet
| APBioNet
| EBI
| NCBI
| CERNET
| PKU
| CBI ]
HMM based Human Gene structure prediction
Name: Seq_name:all
First three lines of sequence:
TTANGATTCGTTTCCATGGAGCTGCCCATGACCATTTACACCATATACATACTGTCTCTGAGCAGAGATACGACA
CTCAGGCTGGTGATAAGGGAACACAGCTGTCAGGGGGCCAGAAGCAGCGTGTCGCCATAGCCCGAGCCATCATCC
GCAACCCCAAACTGTTGCTCCTGGACGAGGCCACGTCTGCGCTCGACACTGAGAGTGAGAAGGTGAGACTTTATT
fgenesh Thu Jun 3 19:20:33 CDT 1999
FGENESH 1.0 Prediction of potential genes in genomic DNA
Time: Thu Jun 3 19:20:33 1999.
Seq name: Seq_name:all
Length of sequence: 39951 GC content: 46 Zone: 2
Number of predicted genes 6 in +chain 4 in -chain 2
Number of predicted exons 78 in +chain 49 in -chain 29
Positions of predicted genes and exons:
G Str Feature Start End Score ORF Len
1 + 1 CDSi 66 - 212 17.52 66 - 212 147
1 + 2 CDSl 285 - 491 17.21 285 - 491 207
1 + PolA 980 2.02
2 + TSS 2279 -7.50
2 + 1 CDSf 2667 - 2781 13.64 2667 - 2780 114
2 + 2 CDSi 2858 - 2891 -4.06 2860 - 2889 30
2 + 3 CDSi 3474 - 3924 36.74 3475 - 3924 450
2 + 4 CDSi 4133 - 4257 10.45 4133 - 4255 123
2 + 5 CDSi 4377 - 4548 21.32 4378 - 4548 171
2 + 6 CDSi 4634 - 4747 7.61 4634 - 4747 114
2 + 7 CDSi 5608 - 5718 2.10 5608 - 5718 111
2 + 8 CDSi 5805 - 5930 18.71 5805 - 5930 126
2 + 9 CDSi 6255 - 6713 41.92 6255 - 6713 459
2 + 10 CDSi 7091 - 7252 21.28 7091 - 7252 162
2 + 11 CDSi 7344 - 7538 17.58 7344 - 7538 195
2 + 12 CDSi 7753 - 7899 9.96 7753 - 7899 147
2 + 13 CDSi 9569 - 9652 4.65 9569 - 9652 84
2 + 14 CDSi 9726 - 9929 21.42 9726 - 9929 204
2 + 15 CDSi 10031 - 10131 0.76 10031 - 10129 99
2 + 16 CDSi 10229 - 10369 9.46 10230 - 10367 138
2 + 17 CDSi 11243 - 11399 9.53 11244 - 11399 156
2 + 18 CDSi 11487 - 11684 25.76 11487 - 11684 198
2 + 19 CDSi 12136 - 12591 56.08 12136 - 12591 456
2 + 20 CDSl 12664 - 12870 19.82 12664 - 12870 207
2 + PolA 13091 2.02
3 + TSS 13221 -7.90
3 + 1 CDSf 13741 - 13784 4.48 13741 - 13782 42
3 + 2 CDSi 13931 - 14027 -5.77 13932 - 14027 96
3 + 3 CDSi 14431 - 14590 21.34 14431 - 14589 159
3 + 4 CDSi 14661 - 14709 2.05 14663 - 14707 45
3 + 5 CDSi 14939 - 15130 15.13 14940 - 15128 189
3 + 6 CDSi 15232 - 15403 30.25 15233 - 15403 171
3 + 7 CDSi 15483 - 15607 4.28 15483 - 15605 123
3 + 8 CDSi 15913 - 16084 31.76 15914 - 16084 171
3 + 9 CDSi 16171 - 16284 12.42 16171 - 16284 114
3 + 10 CDSi 17296 - 17406 7.15 17296 - 17406 111
3 + 11 CDSi 17475 - 17600 16.87 17475 - 17600 126
3 + 12 CDSi 17772 - 18230 46.73 17772 - 18230 459
3 + 13 CDSi 18614 - 18775 23.55 18614 - 18775 162
3 + 14 CDSi 18867 - 19034 11.06 18867 - 19034 168
3 + 15 CDSi 19419 - 19565 11.97 19419 - 19565 147
3 + 16 CDSi 19670 - 19774 8.92 19670 - 19774 105
3 + 17 CDSi 19853 - 19930 1.73 19853 - 19930 78
3 + 18 CDSi 20075 - 20158 4.29 20075 - 20158 84
3 + 19 CDSi 20232 - 20435 27.27 20232 - 20435 204
3 + 20 CDSi 20522 - 20622 0.62 20522 - 20620 99
3 + 21 CDSi 20718 - 20858 14.89 20719 - 20856 138
3 + 22 CDSi 21566 - 21722 17.03 21567 - 21722 156
3 + 23 CDSi 21806 - 22003 17.89 21806 - 22003 198
3 + 24 CDSi 22822 - 22974 15.75 22822 - 22974 153
3 + 25 CDSi 23132 - 23278 17.52 23132 - 23278 147
3 + 26 CDSl 23351 - 23557 26.71 23351 - 23557 207
3 + PolA 23696 -3.38
4 - PolA 23865 2.02
4 - 1 CDSl 24536 - 24656 10.01 24536 - 24655 120
4 - 2 CDSi 24742 - 24861 14.27 24744 - 24860 117
4 - 3 CDSi 24968 - 25061 5.67 24970 - 25059 90
4 - 4 CDSi 25150 - 25228 11.45 25151 - 25228 78
4 - 5 CDSi 25296 - 25419 10.63 25296 - 25418 123
4 - 6 CDSi 25494 - 25624 13.38 25496 - 25624 129
4 - 7 CDSi 25697 - 25768 7.55 25697 - 25768 72
4 - 8 CDSi 25877 - 25960 7.70 25877 - 25960 84
4 - 9 CDSi 26044 - 26145 3.09 26044 - 26145 102
4 - 10 CDSi 26231 - 26356 0.45 26231 - 26356 126
4 - 11 CDSi 26522 - 26615 2.47 26522 - 26614 93
4 - 12 CDSi 26986 - 27094 -1.32 26988 - 27092 105
4 - 13 CDSi 27174 - 27298 6.73 27175 - 27297 123
4 - 14 CDSi 27399 - 27607 8.65 27401 - 27607 207
4 - 15 CDSi 27669 - 27793 8.96 27669 - 27791 123
4 - 16 CDSi 30444 - 30561 13.07 30445 - 30561 117
4 - 17 CDSf 31336 - 31347 6.16 31336 - 31347 12
4 - TSS 31631 -9.90
5 + TSS 31672 -7.20
5 + 1 CDSo 32379 - 33164 23.94 32379 - 33164 786
5 + PolA 33177 2.02
6 - PolA 34304 2.02
6 - 1 CDSl 34929 - 35225 6.17 34929 - 35225 297
6 - 2 CDSi 36062 - 36199 9.95 36062 - 36199 138
6 - 3 CDSi 36276 - 36443 10.59 36276 - 36443 168
6 - 4 CDSi 36530 - 36700 12.50 36530 - 36700 171
6 - 5 CDSi 36784 - 36892 13.55 36784 - 36891 108
6 - 6 CDSi 37087 - 37257 11.69 37089 - 37256 168
6 - 7 CDSi 37349 - 37465 16.92 37351 - 37464 114
6 - 8 CDSi 37550 - 37641 16.88 37552 - 37641 90
6 - 9 CDSi 37748 - 37866 7.72 37748 - 37864 117
6 - 10 CDSi 38179 - 38299 12.31 38180 - 38299 120
6 - 11 CDSi 38926 - 39043 1.93 38926 - 39042 117
6 - 12 CDSf 39473 - 39582 11.97 39475 - 39582 108
Predicted protein(s):
>FGENESH 1 2 exon (s) 66 - 491 117 aa, chain +
RYDTQAGDKGTQLSGGQKQRVAIARAIIRNPKLLLLDEATSALDTESEKVVQEALDQARK
GRTCIVVAHRLSTIQNADCIAVFQGGVVVEKGTHQQLIAKKGVYHMLVTKQMGQHSG
>FGENESH 2 20 exon (s) 2667 - 12870 1252 aa, chain +
MAIVNGLVNPLMCIVFGEMTDSFIQEAKLSQNHNTSNPRANSTLEADMQRFSIYYSILGF
AVLVVAYLQMSLWTLTAARQAKRIRELFFHGIMQQDISWYDVTETGELNTRLTEWVTHII
HTPVPVTAGVVVIICGVRFPGAHDVYKIQEGIGDKAGLLIQAASTFITSFVIGFVHGWKL
TLVILAISPVLGLSAALYSKLLTSFTSKEQTAYAKAGAVAAEVLSSIRTVFAFSGQRKAI
KRYHKNLEDARDMGIKKGVAANTATGFSFLMIYLSYALAFWYGTTLVLNKEYTIGNLLTV
FFVVLYGAYIIGQASPNVQSFASARGAAYKVYNIIDHKPNIDSFSEDGYKPEYIKGDIVF
QNIHFSYPSRPEIKILNDMSFHVRNGQTIALVGSSGCGKSTTIQLLQRFYDPQKGSIFID
GHDIRSLNIRYLREMIGVVSQEPVLFATTITENIRYGRLDVTQEEIERATKESNAYDFIM
NLPDVRPHLWLPYLSLAPSRSNANIYIMISEQKFETLVGDRGTQLSGGQKQRIAIARALV
RNPKILLLDEATSALDAESETIVQAALDKVRLGRTTIVIAHRLSTIRNADIIAGFSNGEI
VEQGTHSQLMEIKGVYHGLVTMQSFQKLEDLEDSDYEPWVAEKSQLIESFSQSSLQRRRS
TRGSLLAVSEGTKEEKEKFECDQDNIEEDENVPPVSFFKVMRYNVSEWPYILVGTICAMI
NGAMQPVFSIIFTEIIMDLSWYDNPKNTVGALTTRLAADAAHVQGAAGVRLAVMTQNFAN
LGTSIIISFVYGWELTLLILAVVPILAVAGAAEVKLLTGHAAEDKKELEMAGKIATEAIE
NVRTVVSLTREPTFVALYEENLTVPYKNSQKKAKIYGLTYSFSQAMIFFVYAACFRFGAW
LIEAGRMDVEGVFLVVMTMLYGAMAVGEANTYAPNFAKAKISASHLTMLINRQPAIDNLS
EEEARLEKYDGNVLFEDVKFNYPSRPDVPVLQGLNLEVQKGETLALVGSSGCGKSTTIQL
LERFYDPREGRVLLDGVDVKQLNVHWLRSQIGIVSQEPVLFDCSLAENIAYGDNSRSVSM
DEIVAAAKAANIHSFIEGLPQVAAVNQGKWLIPHLIDSHGAAHDHLHHIQTVSEQRYDTQ
AGDKGTQLSGGQKQRVAIARAIIRNPKLLLLDEATSALDTESEKVVQEALDQARKGRTCI
VVAHRLSTIQNADCIAVFQGGVVVEKGTHQQLIAKKGVYHMLVTKQMGYHSG
>FGENESH 3 26 exon (s) 13741 - 23557 1290 aa, chain +
MALKIDTAETNGDLSHDSKDDGAKNEKKKKNKKEKPPQEPMVGPITLFRFADRWDVVLLI
SGTVMAMVNGTVMPLMCIVFGEMTDSFIYADMAQHNASGWNSTTTILNSTLQEDMQRFAI
YYSVLGFVVLLAAYMQVSFWTITAGRQVKRIRSLFFHCIMQQEISWFDVNDTGELNTRLT
DDVYKIQEGIGDKVGLLIQAYTTFITAFIIGFTTGWKLTLVILAVSPALAISAAFFSKVL
ASFTSKEQTAYAKAGAVAEEVLSAIRTVFAFSGQTREIERYHKNLRDAKDVGVKKAISSN
IAMGFTFLMIYLSYALAFWYGSTLILNFEYTIGNLLTVFFVVLIGAFSVGQTSPNIQNFA
SARGAAYKVYSIIDNKPNIDSFSEDGFKPDFIKGDIEFKNIHFNYPSRPEVKILNNMSLS
VKSGQTIALVGSSGCGKSTTIQLLQRFYDPEEGAVFIDGHDIRSLNIRYLREMIGVVSQE
PVLFATTITENIRYGRLDVTQEEIERATKESNAYDFIMNLPDVRPHLWLPYLSLAPSRSN
ANIYIMISEQKFETLVGDRGTQLSGGQKQRIAIARALVRNPKILLLDEATSALDAESETI
VQAALDKVRLGRTTIVVAHRLSTIRNADIIAGFSNGKIVEQGTHSQLMEIKGVYHGLVTM
QTFHNVEEENTAMSELSAGEKSPVEKTVSQSSIIRRKSTRGSSFAASEGTKEEKTEEDED
VPDVSFFKVLHLNIPEWPYILVGLICATINGAMQPVFAILFSKIITVFADPDRDSVRRKS
EFISLMFVVIGCVSFVTMFLQGYCFGKSGEILTLKLRLRAFTAMMRQDLSWYDNPQNTVG
ALTTRLAADAAQVQGAAGVRLATIMQNFANLGTSIIIAFVYGWELTLLILAVVPLIAAAG
AAEIKLLAGHAAKDKKELEKAGKIATEAIENVRTVVSLSREPKFECLYEENLRVPYKNSQ
KKAHVYGLTYSFSQAMIYFAYAACFRFGAWLIEAGRMDVEGVFLVVSAVLYGAMAVGEAN
TFAPNYAKAKMAASYLMMLINKKPAIDNLSEEGTSPEKYDGNVHFEGVKFNYPSRPDVTI
LQGLNLKVKKGETLALVGSSGCGKSTTIQLLERFYDPREGRVSLDGVNVKQLNIHWLRSQ
IGIVSQEPVLFDCSLAENIAYGDNSRSVSMDEIRYDTQAGDKGTQLSGGQKQRVAIARAI
IRNPKLLLLDEATSALDTESEKVVQEALDQARKGRTCIVVAHRLSTIQNADCIAVFQGGV
VVEKGTHQQLIAKKGVYHMLVTKQMGYHND
>FGENESH 4 17 exon (s) 24536 - 31347 614 aa, chain -
MSKQAEFEKIAEDVKKVKTRPTDQELLDLYGLYKQAIVGDVNTVRPFASEKEFKATEDIV
RNFQQGVGKELHQRLLQRAETRRNWMFNTVLSSQLEQWWLDAAYLEGRSPSQLTVNFAGP
APYLEHCWPPAEGTALERASICSWHMLQYWNLIRTERLAPQKAGETPLDMDQFRMLYCTC
KVPGVTKDAIRSYFKTELEGRCPSHLVVLCRGRIFTFDALCDGQILTPPELFRQLSYVRQ
CCDGNPEGEGVSALTTEERTRWAKAREYLISIDPHNETILELIQSSLFTICLDETQPYST
PENYTNLTRESLTGDPTIRWGDKSYNSVVYSDGTFGSNCDHAPYDAMVLVTMCWYVDQRI
QSTGGKWKELVFTVDEKVRSDIGRAKKQYFESAQDLQVVCYAFTAFGKAAIKQKKLHPDT
FIQLAMQLAYFKLHQRPGCCYETAMTRKFYHGRTETMRPCTVEAVKWCTAMTDPSCEDNA
KRKAMQLAFEKHNNLMAEAQEGRGFDRHLLGLYLIAKEEGRPVPELFLDPLYAKSGGGGN
FVLSSSLVGYTTVLGAVAPMVPHGYGFFYRIREDRIVISISAWKSCRQTDAVSLFNVFSS
CLHEMLHLATTSQL
>FGENESH 5 1 exon (s) 32379 - 33164 261 aa, chain +
MAAAAKSAKKESKRYIPTKTCFTSPFTPKWSPLPQEDMHFILNTLKENFVSIGLVKKEPK
VFRPWRKKKKQEAAQSQDSDLQVSQDAASQEPPKRGWTDVAARRKLAIGINEVTKALERN
ELKLLLVCKCVKPQHMMEHLITLSTTRDVPACQVPRLSQSVSEPLGLKSVLALGFRQCLP
QERDVFSNVVEAILPKVPPLDVPWLQDTPASIKPDENRGQKRRLETESEEGTPVSSTTLQ
PLKVKKIVPNSARKGKGKKKV
>FGENESH 6 12 exon (s) 34929 - 39582 576 aa, chain -
MAEDSESAASQQSLELDDQDTCGIDGDNEEENEHLQGSPGGDLGAKRKKKKQKRKKEKPS
SGGAKSDSASDSQEFKKLQDIQRAMELLSCQGPAKSIDEAAKHKYQFWDTQPVPKLNEVV
TSHGPIEADKENIRQEPYSLPQGFMWDTLDLGSAEVLKELYTLLNENYVEDDDNMFRFDY
SPNFLKWALRPPGWLPQWHCGVRVSSNKKLVGFISAIPADIRIYDTVKRMVEINFLCVHK
KLRSKRVAPVLIREITRRVNLEGIFQAVYTAGVVLPKPVSTCRYWHRSLNPRKLVEVKFS
HLSRNMTLQRTMKLYRLPDSTKTPGLRPMERRDIRQVTELLQKFLKRFQLAPSMTEEEVS
HWFLPQDNIIDTYVVEGAGGALTDFASFYTLPSTVMHHPLHRSLKAAYSFYNVHTQTPLL
DLMNDALILAKLKGFDVFNALDLMENKVFLEKLKFGIGDGNLQYYLYNWKCPSMEPDKLK
VLRVRELVLSSSLDPNLINYKHVGAAGRFISHHIGRNWASVQMKWQQTNSKVSAADVTKS
RRAASDGREKVTVNDSARDPSKRKTQEQDWSRGAFT
[ GCG
| w2h
| Staden
| GeneExplorer ]
[ WebGene
| GeneFinder
| Grail
| PROCRUSTES ]
Last modified: Fri, 4 June 1999,
[email protected]