FgeneH output

[ EMBnet | APBioNet | EBI | NCBI | CERNET | PKU | CBI ]

HMM based Human Gene structure prediction


Name: Seq_name:all
First three lines of sequence:
TTANGATTCGTTTCCATGGAGCTGCCCATGACCATTTACACCATATACATACTGTCTCTGAGCAGAGATACGACA
CTCAGGCTGGTGATAAGGGAACACAGCTGTCAGGGGGCCAGAAGCAGCGTGTCGCCATAGCCCGAGCCATCATCC
GCAACCCCAAACTGTTGCTCCTGGACGAGGCCACGTCTGCGCTCGACACTGAGAGTGAGAAGGTGAGACTTTATT


fgenesh  Thu Jun  3 19:20:33 CDT 1999
 FGENESH 1.0 Prediction of potential genes in genomic DNA
 Time:   Thu Jun  3 19:20:33 1999.
 Seq name: Seq_name:all 
 Length of sequence:  39951  GC content: 46 Zone: 2
 Number of predicted genes 6 in +chain 4 in -chain 2
 Number of predicted exons 78 in +chain 49 in -chain 29
 Positions of predicted genes and exons:
  G Str Feature    Start     End   Score        ORF           Len

  1 +   1 CDSi      66 -     212   17.52      66 -     212    147
  1 +   2 CDSl     285 -     491   17.21     285 -     491    207
  1 +     PolA     980              2.02  

  2 +     TSS     2279             -7.50  
  2 +   1 CDSf    2667 -    2781   13.64    2667 -    2780    114
  2 +   2 CDSi    2858 -    2891   -4.06    2860 -    2889     30
  2 +   3 CDSi    3474 -    3924   36.74    3475 -    3924    450
  2 +   4 CDSi    4133 -    4257   10.45    4133 -    4255    123
  2 +   5 CDSi    4377 -    4548   21.32    4378 -    4548    171
  2 +   6 CDSi    4634 -    4747    7.61    4634 -    4747    114
  2 +   7 CDSi    5608 -    5718    2.10    5608 -    5718    111
  2 +   8 CDSi    5805 -    5930   18.71    5805 -    5930    126
  2 +   9 CDSi    6255 -    6713   41.92    6255 -    6713    459
  2 +  10 CDSi    7091 -    7252   21.28    7091 -    7252    162
  2 +  11 CDSi    7344 -    7538   17.58    7344 -    7538    195
  2 +  12 CDSi    7753 -    7899    9.96    7753 -    7899    147
  2 +  13 CDSi    9569 -    9652    4.65    9569 -    9652     84
  2 +  14 CDSi    9726 -    9929   21.42    9726 -    9929    204
  2 +  15 CDSi   10031 -   10131    0.76   10031 -   10129     99
  2 +  16 CDSi   10229 -   10369    9.46   10230 -   10367    138
  2 +  17 CDSi   11243 -   11399    9.53   11244 -   11399    156
  2 +  18 CDSi   11487 -   11684   25.76   11487 -   11684    198
  2 +  19 CDSi   12136 -   12591   56.08   12136 -   12591    456
  2 +  20 CDSl   12664 -   12870   19.82   12664 -   12870    207
  2 +     PolA   13091              2.02  

  3 +     TSS    13221             -7.90  
  3 +   1 CDSf   13741 -   13784    4.48   13741 -   13782     42
  3 +   2 CDSi   13931 -   14027   -5.77   13932 -   14027     96
  3 +   3 CDSi   14431 -   14590   21.34   14431 -   14589    159
  3 +   4 CDSi   14661 -   14709    2.05   14663 -   14707     45
  3 +   5 CDSi   14939 -   15130   15.13   14940 -   15128    189
  3 +   6 CDSi   15232 -   15403   30.25   15233 -   15403    171
  3 +   7 CDSi   15483 -   15607    4.28   15483 -   15605    123
  3 +   8 CDSi   15913 -   16084   31.76   15914 -   16084    171
  3 +   9 CDSi   16171 -   16284   12.42   16171 -   16284    114
  3 +  10 CDSi   17296 -   17406    7.15   17296 -   17406    111
  3 +  11 CDSi   17475 -   17600   16.87   17475 -   17600    126
  3 +  12 CDSi   17772 -   18230   46.73   17772 -   18230    459
  3 +  13 CDSi   18614 -   18775   23.55   18614 -   18775    162
  3 +  14 CDSi   18867 -   19034   11.06   18867 -   19034    168
  3 +  15 CDSi   19419 -   19565   11.97   19419 -   19565    147
  3 +  16 CDSi   19670 -   19774    8.92   19670 -   19774    105
  3 +  17 CDSi   19853 -   19930    1.73   19853 -   19930     78
  3 +  18 CDSi   20075 -   20158    4.29   20075 -   20158     84
  3 +  19 CDSi   20232 -   20435   27.27   20232 -   20435    204
  3 +  20 CDSi   20522 -   20622    0.62   20522 -   20620     99
  3 +  21 CDSi   20718 -   20858   14.89   20719 -   20856    138
  3 +  22 CDSi   21566 -   21722   17.03   21567 -   21722    156
  3 +  23 CDSi   21806 -   22003   17.89   21806 -   22003    198
  3 +  24 CDSi   22822 -   22974   15.75   22822 -   22974    153
  3 +  25 CDSi   23132 -   23278   17.52   23132 -   23278    147
  3 +  26 CDSl   23351 -   23557   26.71   23351 -   23557    207
  3 +     PolA   23696             -3.38  

  4 -     PolA   23865              2.02  
  4 -   1 CDSl   24536 -   24656   10.01   24536 -   24655    120
  4 -   2 CDSi   24742 -   24861   14.27   24744 -   24860    117
  4 -   3 CDSi   24968 -   25061    5.67   24970 -   25059     90
  4 -   4 CDSi   25150 -   25228   11.45   25151 -   25228     78
  4 -   5 CDSi   25296 -   25419   10.63   25296 -   25418    123
  4 -   6 CDSi   25494 -   25624   13.38   25496 -   25624    129
  4 -   7 CDSi   25697 -   25768    7.55   25697 -   25768     72
  4 -   8 CDSi   25877 -   25960    7.70   25877 -   25960     84
  4 -   9 CDSi   26044 -   26145    3.09   26044 -   26145    102
  4 -  10 CDSi   26231 -   26356    0.45   26231 -   26356    126
  4 -  11 CDSi   26522 -   26615    2.47   26522 -   26614     93
  4 -  12 CDSi   26986 -   27094   -1.32   26988 -   27092    105
  4 -  13 CDSi   27174 -   27298    6.73   27175 -   27297    123
  4 -  14 CDSi   27399 -   27607    8.65   27401 -   27607    207
  4 -  15 CDSi   27669 -   27793    8.96   27669 -   27791    123
  4 -  16 CDSi   30444 -   30561   13.07   30445 -   30561    117
  4 -  17 CDSf   31336 -   31347    6.16   31336 -   31347     12
  4 -     TSS    31631             -9.90  

  5 +     TSS    31672             -7.20  
  5 +   1 CDSo   32379 -   33164   23.94   32379 -   33164    786
  5 +     PolA   33177              2.02  

  6 -     PolA   34304              2.02  
  6 -   1 CDSl   34929 -   35225    6.17   34929 -   35225    297
  6 -   2 CDSi   36062 -   36199    9.95   36062 -   36199    138
  6 -   3 CDSi   36276 -   36443   10.59   36276 -   36443    168
  6 -   4 CDSi   36530 -   36700   12.50   36530 -   36700    171
  6 -   5 CDSi   36784 -   36892   13.55   36784 -   36891    108
  6 -   6 CDSi   37087 -   37257   11.69   37089 -   37256    168
  6 -   7 CDSi   37349 -   37465   16.92   37351 -   37464    114
  6 -   8 CDSi   37550 -   37641   16.88   37552 -   37641     90
  6 -   9 CDSi   37748 -   37866    7.72   37748 -   37864    117
  6 -  10 CDSi   38179 -   38299   12.31   38180 -   38299    120
  6 -  11 CDSi   38926 -   39043    1.93   38926 -   39042    117
  6 -  12 CDSf   39473 -   39582   11.97   39475 -   39582    108

Predicted protein(s):
>FGENESH   1   2 exon (s)     66  -    491    117 aa, chain +
RYDTQAGDKGTQLSGGQKQRVAIARAIIRNPKLLLLDEATSALDTESEKVVQEALDQARK
GRTCIVVAHRLSTIQNADCIAVFQGGVVVEKGTHQQLIAKKGVYHMLVTKQMGQHSG
>FGENESH   2  20 exon (s)   2667  -  12870   1252 aa, chain +
MAIVNGLVNPLMCIVFGEMTDSFIQEAKLSQNHNTSNPRANSTLEADMQRFSIYYSILGF
AVLVVAYLQMSLWTLTAARQAKRIRELFFHGIMQQDISWYDVTETGELNTRLTEWVTHII
HTPVPVTAGVVVIICGVRFPGAHDVYKIQEGIGDKAGLLIQAASTFITSFVIGFVHGWKL
TLVILAISPVLGLSAALYSKLLTSFTSKEQTAYAKAGAVAAEVLSSIRTVFAFSGQRKAI
KRYHKNLEDARDMGIKKGVAANTATGFSFLMIYLSYALAFWYGTTLVLNKEYTIGNLLTV
FFVVLYGAYIIGQASPNVQSFASARGAAYKVYNIIDHKPNIDSFSEDGYKPEYIKGDIVF
QNIHFSYPSRPEIKILNDMSFHVRNGQTIALVGSSGCGKSTTIQLLQRFYDPQKGSIFID
GHDIRSLNIRYLREMIGVVSQEPVLFATTITENIRYGRLDVTQEEIERATKESNAYDFIM
NLPDVRPHLWLPYLSLAPSRSNANIYIMISEQKFETLVGDRGTQLSGGQKQRIAIARALV
RNPKILLLDEATSALDAESETIVQAALDKVRLGRTTIVIAHRLSTIRNADIIAGFSNGEI
VEQGTHSQLMEIKGVYHGLVTMQSFQKLEDLEDSDYEPWVAEKSQLIESFSQSSLQRRRS
TRGSLLAVSEGTKEEKEKFECDQDNIEEDENVPPVSFFKVMRYNVSEWPYILVGTICAMI
NGAMQPVFSIIFTEIIMDLSWYDNPKNTVGALTTRLAADAAHVQGAAGVRLAVMTQNFAN
LGTSIIISFVYGWELTLLILAVVPILAVAGAAEVKLLTGHAAEDKKELEMAGKIATEAIE
NVRTVVSLTREPTFVALYEENLTVPYKNSQKKAKIYGLTYSFSQAMIFFVYAACFRFGAW
LIEAGRMDVEGVFLVVMTMLYGAMAVGEANTYAPNFAKAKISASHLTMLINRQPAIDNLS
EEEARLEKYDGNVLFEDVKFNYPSRPDVPVLQGLNLEVQKGETLALVGSSGCGKSTTIQL
LERFYDPREGRVLLDGVDVKQLNVHWLRSQIGIVSQEPVLFDCSLAENIAYGDNSRSVSM
DEIVAAAKAANIHSFIEGLPQVAAVNQGKWLIPHLIDSHGAAHDHLHHIQTVSEQRYDTQ
AGDKGTQLSGGQKQRVAIARAIIRNPKLLLLDEATSALDTESEKVVQEALDQARKGRTCI
VVAHRLSTIQNADCIAVFQGGVVVEKGTHQQLIAKKGVYHMLVTKQMGYHSG
>FGENESH   3  26 exon (s)  13741  -  23557   1290 aa, chain +
MALKIDTAETNGDLSHDSKDDGAKNEKKKKNKKEKPPQEPMVGPITLFRFADRWDVVLLI
SGTVMAMVNGTVMPLMCIVFGEMTDSFIYADMAQHNASGWNSTTTILNSTLQEDMQRFAI
YYSVLGFVVLLAAYMQVSFWTITAGRQVKRIRSLFFHCIMQQEISWFDVNDTGELNTRLT
DDVYKIQEGIGDKVGLLIQAYTTFITAFIIGFTTGWKLTLVILAVSPALAISAAFFSKVL
ASFTSKEQTAYAKAGAVAEEVLSAIRTVFAFSGQTREIERYHKNLRDAKDVGVKKAISSN
IAMGFTFLMIYLSYALAFWYGSTLILNFEYTIGNLLTVFFVVLIGAFSVGQTSPNIQNFA
SARGAAYKVYSIIDNKPNIDSFSEDGFKPDFIKGDIEFKNIHFNYPSRPEVKILNNMSLS
VKSGQTIALVGSSGCGKSTTIQLLQRFYDPEEGAVFIDGHDIRSLNIRYLREMIGVVSQE
PVLFATTITENIRYGRLDVTQEEIERATKESNAYDFIMNLPDVRPHLWLPYLSLAPSRSN
ANIYIMISEQKFETLVGDRGTQLSGGQKQRIAIARALVRNPKILLLDEATSALDAESETI
VQAALDKVRLGRTTIVVAHRLSTIRNADIIAGFSNGKIVEQGTHSQLMEIKGVYHGLVTM
QTFHNVEEENTAMSELSAGEKSPVEKTVSQSSIIRRKSTRGSSFAASEGTKEEKTEEDED
VPDVSFFKVLHLNIPEWPYILVGLICATINGAMQPVFAILFSKIITVFADPDRDSVRRKS
EFISLMFVVIGCVSFVTMFLQGYCFGKSGEILTLKLRLRAFTAMMRQDLSWYDNPQNTVG
ALTTRLAADAAQVQGAAGVRLATIMQNFANLGTSIIIAFVYGWELTLLILAVVPLIAAAG
AAEIKLLAGHAAKDKKELEKAGKIATEAIENVRTVVSLSREPKFECLYEENLRVPYKNSQ
KKAHVYGLTYSFSQAMIYFAYAACFRFGAWLIEAGRMDVEGVFLVVSAVLYGAMAVGEAN
TFAPNYAKAKMAASYLMMLINKKPAIDNLSEEGTSPEKYDGNVHFEGVKFNYPSRPDVTI
LQGLNLKVKKGETLALVGSSGCGKSTTIQLLERFYDPREGRVSLDGVNVKQLNIHWLRSQ
IGIVSQEPVLFDCSLAENIAYGDNSRSVSMDEIRYDTQAGDKGTQLSGGQKQRVAIARAI
IRNPKLLLLDEATSALDTESEKVVQEALDQARKGRTCIVVAHRLSTIQNADCIAVFQGGV
VVEKGTHQQLIAKKGVYHMLVTKQMGYHND
>FGENESH   4  17 exon (s)  24536  -  31347    614 aa, chain -
MSKQAEFEKIAEDVKKVKTRPTDQELLDLYGLYKQAIVGDVNTVRPFASEKEFKATEDIV
RNFQQGVGKELHQRLLQRAETRRNWMFNTVLSSQLEQWWLDAAYLEGRSPSQLTVNFAGP
APYLEHCWPPAEGTALERASICSWHMLQYWNLIRTERLAPQKAGETPLDMDQFRMLYCTC
KVPGVTKDAIRSYFKTELEGRCPSHLVVLCRGRIFTFDALCDGQILTPPELFRQLSYVRQ
CCDGNPEGEGVSALTTEERTRWAKAREYLISIDPHNETILELIQSSLFTICLDETQPYST
PENYTNLTRESLTGDPTIRWGDKSYNSVVYSDGTFGSNCDHAPYDAMVLVTMCWYVDQRI
QSTGGKWKELVFTVDEKVRSDIGRAKKQYFESAQDLQVVCYAFTAFGKAAIKQKKLHPDT
FIQLAMQLAYFKLHQRPGCCYETAMTRKFYHGRTETMRPCTVEAVKWCTAMTDPSCEDNA
KRKAMQLAFEKHNNLMAEAQEGRGFDRHLLGLYLIAKEEGRPVPELFLDPLYAKSGGGGN
FVLSSSLVGYTTVLGAVAPMVPHGYGFFYRIREDRIVISISAWKSCRQTDAVSLFNVFSS
CLHEMLHLATTSQL
>FGENESH   5   1 exon (s)  32379  -  33164    261 aa, chain +
MAAAAKSAKKESKRYIPTKTCFTSPFTPKWSPLPQEDMHFILNTLKENFVSIGLVKKEPK
VFRPWRKKKKQEAAQSQDSDLQVSQDAASQEPPKRGWTDVAARRKLAIGINEVTKALERN
ELKLLLVCKCVKPQHMMEHLITLSTTRDVPACQVPRLSQSVSEPLGLKSVLALGFRQCLP
QERDVFSNVVEAILPKVPPLDVPWLQDTPASIKPDENRGQKRRLETESEEGTPVSSTTLQ
PLKVKKIVPNSARKGKGKKKV
>FGENESH   6  12 exon (s)  34929  -  39582    576 aa, chain -
MAEDSESAASQQSLELDDQDTCGIDGDNEEENEHLQGSPGGDLGAKRKKKKQKRKKEKPS
SGGAKSDSASDSQEFKKLQDIQRAMELLSCQGPAKSIDEAAKHKYQFWDTQPVPKLNEVV
TSHGPIEADKENIRQEPYSLPQGFMWDTLDLGSAEVLKELYTLLNENYVEDDDNMFRFDY
SPNFLKWALRPPGWLPQWHCGVRVSSNKKLVGFISAIPADIRIYDTVKRMVEINFLCVHK
KLRSKRVAPVLIREITRRVNLEGIFQAVYTAGVVLPKPVSTCRYWHRSLNPRKLVEVKFS
HLSRNMTLQRTMKLYRLPDSTKTPGLRPMERRDIRQVTELLQKFLKRFQLAPSMTEEEVS
HWFLPQDNIIDTYVVEGAGGALTDFASFYTLPSTVMHHPLHRSLKAAYSFYNVHTQTPLL
DLMNDALILAKLKGFDVFNALDLMENKVFLEKLKFGIGDGNLQYYLYNWKCPSMEPDKLK
VLRVRELVLSSSLDPNLINYKHVGAAGRFISHHIGRNWASVQMKWQQTNSKVSAADVTKS
RRAASDGREKVTVNDSARDPSKRKTQEQDWSRGAFT


[ GCG | w2h | Staden | GeneExplorer ]
[ WebGene | GeneFinder | Grail | PROCRUSTES ]

Last modified: Fri, 4 June 1999, [email protected]