FGenes output
[ EMBnet
| APBioNet
| EBI
| NCBI
| CERNET
| PKU
| CBI ]
Pattern based Human Gene structure prediction
Name: Seq_name:all
First three lines of sequence:
TTANGATTCGTTTCCATGGAGCTGCCCATGACCATTTACACCATATACATACTGTCTCTGAGCAGAGATACGACA
CTCAGGCTGGTGATAAGGGAACACAGCTGTCAGGGGGCCAGAAGCAGCGTGTCGCCATAGCCCGAGCCATCATCC
GCAACCCCAAACTGTTGCTCCTGGACGAGGCCACGTCTGCGCTCGACACTGAGAGTGAGAAGGTGAGACTTTATT
fgenes Thu Jun 3 19:17:31 CDT 1999
FGENES 1.6 Prediction of multiple genes in genomic DNA
Time: 19:16:25 Date: Thu Jun 3 1999
Seq name: >Seq_name:all
Length of sequence: 39951 GC content: 0.46 Zone: 2
Number of predicted genes: 3 In +chain: 1 In -chain: 2
Number of predicted exons: 57 In +chain: 38 In -chain: 19
Positions of predicted genes and exons:
G Str Feature Start End Weight ORF-start ORF-end
1 + 1 CDSf 2112 - 2152 4.87 2112 - 2150
1 + 2 CDSi 2629 - 2715 1.27 2630 - 2713
1 + 3 CDSi 2858 - 2891 1.46 2859 - 2891
1 + 4 CDSi 3474 - 3620 1.91 3474 - 3620
1 + 5 CDSi 4133 - 4257 2.99 4133 - 4255
1 + 6 CDSi 4377 - 4548 1.95 4378 - 4548
1 + 7 CDSi 5608 - 5718 0.58 5608 - 5718
1 + 8 CDSi 5805 - 5930 2.78 5805 - 5930
1 + 9 CDSi 6255 - 6458 2.09 6255 - 6458
1 + 10 CDSi 6543 - 6713 0.53 6543 - 6713
1 + 11 CDSi 7091 - 7252 3.20 7091 - 7252
1 + 12 CDSi 7344 - 7538 2.39 7344 - 7538
1 + 13 CDSi 7753 - 7899 1.25 7753 - 7899
1 + 14 CDSi 9203 - 9295 1.27 9203 - 9295
1 + 15 CDSi 9569 - 9652 1.77 9569 - 9652
1 + 16 CDSi 9726 - 9929 2.82 9726 - 9929
1 + 17 CDSi 10031 - 10054 1.84 10031 - 10054
1 + 18 CDSi 10281 - 10369 1.24 10281 - 10367
1 + 19 CDSi 11243 - 11399 1.54 11244 - 11399
1 + 20 CDSi 11487 - 11684 2.34 11487 - 11684
1 + 21 CDSi 12136 - 12591 3.62 12136 - 12591
1 + 22 CDSi 14431 - 14590 2.76 14431 - 14589
1 + 23 CDSi 14661 - 14709 0.76 14663 - 14707
1 + 24 CDSi 15232 - 15403 4.36 15233 - 15403
1 + 25 CDSi 15913 - 16029 1.80 15913 - 16029
1 + 26 CDSi 16171 - 16284 1.65 16171 - 16284
1 + 27 CDSi 17296 - 17406 0.90 17296 - 17406
1 + 28 CDSi 17475 - 17600 1.68 17475 - 17600
1 + 29 CDSi 17772 - 17975 4.02 17772 - 17975
1 + 30 CDSi 18060 - 18230 1.82 18060 - 18230
1 + 31 CDSi 18614 - 18775 2.16 18614 - 18775
1 + 32 CDSi 18867 - 19034 0.98 18867 - 19034
1 + 33 CDSi 19670 - 19774 3.96 19670 - 19774
1 + 34 CDSi 20075 - 20158 1.47 20075 - 20158
1 + 35 CDSi 20232 - 20435 1.78 20232 - 20435
1 + 36 CDSi 20522 - 20622 2.06 20522 - 20620
1 + 37 CDSi 20718 - 20858 1.55 20719 - 20856
1 + 38 CDSl 21566 - 21752 5.28 21567 - 21749
1 + PolA 24311 6.56
2 - PolA 23286 3.39
2 - 1 CDSl 24536 - 24861 3.98 24539 - 24859
2 - 2 CDSi 25150 - 25228 4.26 25151 - 25228
2 - 3 CDSi 25296 - 25419 1.41 25296 - 25418
2 - 4 CDSi 25494 - 25618 1.66 25496 - 25618
2 - 5 CDSi 25697 - 25768 0.82 25697 - 25768
2 - 6 CDSi 25877 - 25960 1.58 25877 - 25960
2 - 7 CDSi 26522 - 26615 1.06 26522 - 26614
2 - 8 CDSi 27399 - 27626 1.53 27401 - 27625
2 - 9 CDSi 27747 - 27793 0.94 27749 - 27793
2 - 10 CDSi 28430 - 28553 2.21 28430 - 28552
2 - 11 CDSi 30488 - 30561 2.72 30490 - 30561
2 - 12 CDSf 31336 - 31347 5.60 31336 - 31347
3 - PolA 35164 1.87
3 - 1 CDSl 35717 - 36199 3.93 35720 - 36199
3 - 2 CDSi 36276 - 36443 1.04 36276 - 36443
3 - 3 CDSi 36758 - 36892 1.88 36758 - 36892
3 - 4 CDSi 37550 - 37641 2.97 37550 - 37639
3 - 5 CDSi 38179 - 38320 3.45 38180 - 38320
3 - 6 CDSi 38926 - 39043 1.87 38926 - 39042
3 - 7 CDSf 39473 - 39582 8.77 39475 - 39582
Predicted proteins:
>FGENES 1.5 >Seq_name:all 1 Multiexon gene 2112 - 21752 1800 a Ch+
MDMKSEIVEKQNLRWLGHLDGHHRSADGHCERAGESSDVYRVWSKQHLRSRYAEILHLLL
HLGVCCAGSGVPADVSVDPNGRAAGQTNSRVVFPRHHAAGHQLLLTSFTSKEQTAYAKAG
AVAAEVLSSIRTVFAFSGQRKAIKRYHKNLEDARDMGIKKGVAANTATGFSFLMIYLSYA
LAFWYGTTLVLNKEYTIGNLLTKPNIDSFSEDGYKPEYIKGDIVFQNIHFSYPSRPEIKI
LNDMSFHVRNGQTIALVGSSGCGKSTTIQLLQRFYDPQKGSIFIDGHDIRSLNIRYLREM
IGVVSQEPVLFATTITENIRYGRLDVTQEEIERATKESNAYDFIMNLPDKFETLVGDRGT
QLSGGQKQRIAIARALVRNPKILLLDEATSALDAESETIVQAALDKVRLGRTTIVIAHRL
STIRNADIIAGFSNGEIVEQGTHSQLMEIKGVYHGLVTMQSFQKLEDLEDSDYEPWVAEK
SQLIESFSQSSLQRRRSTRGSLLAVSEGTKEEKEKFECDQDNIEEDENVPPVSFFKVMRY
NVSEWPYILVGTICAMINGAMQPVFSIIFTEIIMFWGFQGFCFSKSGEILTLNLRLKAFI
SMMRQDLSWYDNPKNTVGALTTRLAADAAHVQGAAGVRLAVMTQNFANLGTSIIISFVYG
WELTLLILAVVPILAVAGAAEVKLLTGHAAEDKKELEMAGKIATEAIENAMIFFVYAACF
RFGAWLIEAGRMDVEGVFLVVMTMLYGAMAVGEANTYAPNFAKAKISASHLTMLINRQPA
IDNLSEEEARLEKYDGNVLFEDVKFNYPSRPDVPVLQGLNLEVQKGETLALVGSSGCGKS
TTIQLLERFYDPREGRVLLDGVDVKQLNVHWLRSQIGIVSQEPVLFDCSLAENIAYGDNS
RSVSMDEIVAAAKAANIHSFIEGLPQVAAVNQGKWLIPHLIDSHGAAHDHLHHIQTVSEQ
RYDTQAGDKGTQLSGGQKQRVAIARAIIRNPKLLLLDEATSALDTESEKFRFADRWDVVL
LISGTVMAMVNGTVMPLMCIVFGEMTDSFIYADMAQHNASGWNSTTTILNSTLQEDMQSD
VYKIQEGIGDKVGLLIQAYTTFITAFIIGFTTGWKLTLVILAVSPALAISAAFFSKIPQE
PAGRKGRGSEEGHLLQHRHGLHLPDDLPVLCSGLLVFFVVLIGAFSVGQTSPNIQNFASA
RGAAYKVYSIIDNKPNIDSFSEDGFKPDFIKGDIEFKNIHFNYPSRPEVKILNNMSLSVK
SGQTIALVGSSGCGKSTTIQLLQRFYDPEEGAVFIDGHDIRSLNIRYLREMIGVVSQEPV
LFATTITENIRYGRLDVTQEEIERATKESNAYDFIMNLPDKFETLVGDRGTQLSGGQKQR
IAIARALVRNPKILLLDEATSALDAESETIVQAALDKVRLGRTTIVVAHRLSTIRNADII
AGFSNGKIVEQGTHSQLMEIKGVYHGLVTMQTFHNVEEENTAMSELSAGEKSPVEKTVSQ
SSIIRRKSTRGSSFAASEGTKEEKTEEVFADPDRDSVRRKSEFISLMFVVIGCVSFVTMF
LQDLSWYDNPQNTVGALTTRLAADAAQVQGAAGVRLATIMQNFANLGTSIIIAFVYGWEL
TLLILAVVPLIAAAGAAEIKLLAGHAAKDKKELEKAGKIATEAIENVRTVVSLSREPKFE
CLYEENLRVPYKNSQKKAHVYGLTYSFSQAMIYFAYAACFRFGAWLIEAGRMDVEGVFLV
VSAVLYGAMAVGEANTFAPNYAKAKMAASYLMMLINKKPAIDNLSEEGTSPVNTQDPAQR
>FGENES 1.5 >Seq_name:all 2 Multiexon gene 24536 - 31347 462 a Ch-
MSKQAEFEKIAEDVKKVKTRPTDQELLDLENKWLINCQNQFQNGPSSTKVVCLPCPSHHW
KLAFLSIWKQFVLLRPRRNSRPQRTLPPLSSQMFNTVLSSQLEQWWLDAAYLEGRSPSQL
TVNFAGPAPYLEHCWPPAEGTALERASICSWHMLQYWNLIRTQLSYVRQCCDGNPEGEGV
SALTTEERTRWAKHAPYDAMVLVTMCWYVDQRIQSTGGKWKELVFTVDEKVRSDIGRAKK
QYFESDLQVVCYAFTAFGKAAIKQKKLHPDTFIQLAMQLAYFKLHQRPGCCYETAMTRKF
YHGRTETMRPCTVEAVKWCTAMTDPSCEDNAKRKAMQLAFEKHNNLMAEAQEGRVAVAET
LCCRPAWWATPQFWARWRRWFPTATASSTVSERTGGPTDGSLCCRHGLTHHIQSYPLCTL
TRRIVISISAWKSCRQTDAVSLFNVFSSCLHEMLHLATTSQL
>FGENES 1.5 >Seq_name:all 3 Multiexon gene 35717 - 39582 415 a Ch-
MAEDSESAASQQSLELDDQDTCGIDGDNEEENEHLQGSPGGDLGAKRKKKKQKRKKEKPS
SGGAKSDSASDSQEFKNPTLPIQKLQDIQRAMELLSCQGPAKSIDEAAKHKYQFWDTQPV
PKLTEGVVHVTERELRGGRRQHVQIRLFAKLSQMVLAPFSEPQEACGSEVLPPEQKHDPA
TDHEALQITRRKQHQPDAQGAGGALTDFASFYTLPSTVMHHPLHRSLKAAYSFYNVHTQT
PLLDLMNDALILAKLKGFDVFNALDLMENKVFLEKLKFGIGDGNLQYYLYNWKCPSMEPD
KVRINNACISISRRRCFCSLQPWLPFRSASSFSSRVPQGCYTNTGPKVTTDRGSCHLDMQ
VAQTLKMKRIQPTRRPSGVPPPPPLSDNTTPPQFAGRGGDNSSIINILNVKSCYA
[ GCG
| w2h
| Staden
| GeneExplorer ]
[ WebGene
| GeneFinder
| Grail
| PROCRUSTES ]
Last modified: Fri, 4 June 1999,
[email protected]