GeneBuilder
Gene Structure Prediction System
Organism:
Human
Mouse
Mode:
Gene
Exon
Strand:
Direct
Complement
Sequencing error correction:
Disable
Error report
Automatic correction
Splice sites prediction:
All
Excellent only
Potential coding regions:
All
Good
Excellent
Key protein similarity
First and last coding exons:
Exons with key protein high similarity
Disabled
Sequence segment for coding regions prediction:
First position
Last position
Complete gene model:
Yes
No
Use repeated elements mapping:
Yes
No
Use EST mapping:
Yes
No
Protein homology search:
Yes
No View N.
1
2
3
4
5
6
7
8
9
10
15
20
protein sequences
Use most similar protein:
Yes
No
TATA box prediction:
TATA similarity:
Good
Marginal
POLY-A site prediction:
PolyA pattern length:
6
7
8
9
10
11
12
MatInspector search:
Please select a
matrix group
Vertebrates
Fungi
Insects
Plants
Miscellaneous
All
Core similarity:
(max:1.0)
Matrix similarity:
(max:1.0)
Mail To :
No
Yes
Copy and paste or upload the Sequence file
AGCTTTCTTC TTTTCCCTGT TGCTCAAATA AATAGTGTTC TTTGCTCAAA CCCCCTTTCC CTCCTCCTTC TGCAATCTCA GCGCCTAGCG AAATCTGTTT TCTTCATTGT AACCTCAGCT TCACCGCAAT TAATTTTTTT TCCCTCTGGT CACAAGATAA TTCCTGACGC CAGTGAGTCT GGAGGTCAGA CGAACAGCAA ATTGGGGAAC AAGGCGGCAC TAATTCCTTA CAAGTTCCTT GAAAAATCTT TCGCTTAAAA AAAACGGGGG GTGGGGGGAG CTTCTTTGCT GTTCAGGGAT TTATGCCTCG CGGAGCTGTG GCTCGAACCA GTGTTGGCTA AGGCGGACTG GCAGGGGCAG GGAAGCTCAA AGATCTGGGG TGCTGCCAGG AAAAAGCAAA TTCTGGAAGT TAATGGTTTT GAGTGATTTT TAAATCCTTG CTGGCGGAGA GGCCCGCCTC TCCCCGGTAT CAGCGCTTCC TCATTCTTTG AATCCGCGGC TCCGCGGTCT TCGGCGTCAG ACCAGCCGGA GGAAGCCTGT TTGCAATTTA AGCGGGCTGT GAACGCCCAG GGCCGGCGGG GGCAGGGCCG AGGCGGGCCA TTTTGAATAA AGAGGCGTGC CTTCCAGGCA GGCTCTATAA GTGACCGCCG CGGCGAGCGT GCGCGCGTTG CAGGTCACTG TAGCGGACTT CTTTTGGTTT TCTTTCTCTT TGGGGCACCT CTGGACTCAC TCCCCAGCAT GAAGGCGCTG AGCCCGGTGC GCGGCTGCTA CGAGGCGGTG TGCTGCCTGT CGGAACGCAG TCTGGCCATC GCCCGGGGCC GAGGGAAGGG CCCGGCAGCT GAGGAGCCGC TGAGCTTGCT GGACGACATG AACCACTGCT ACTCCCGCCT GCGGGAACTG GTACCCGGAG TCCCGAGAGG CACTCAGCTT AGCCAGGTGG AAATCCTACA GCGCGTCATC GACTACATTC TCGACCTGCA GGTAGTCCTG GCCGAGCCAG CCCCTGGACC CCCTGATGGC CCCCACCTTC CCATCCAGGT AAGCCTCGAA GTCGGGACAG GGCTGAACAC CCAGGCAAGG ATGCTGCGGG ACCCTCGGAG CTCCCGATTG CCTCGCGTAA CTCTTCCCTC TTTTCCTCTA ATCAGACAGC CGAGCTCGCT CCGGAACTTG TCATCTCCAA CGACAAAAGG AGCTTTTGCC ACTGACTCGG CCGTGTCCTG ACACCTCCAG GTGAGTATCT CCTCTCTTGG AGAGGGAGGT TTAAACGGCA AGTCCTGGAG TTGGCAGACG TTTTGAAAAA TTGCCACTCA CTCGGTTTAG GGAAACTGAG GCCAGAGAGG GACAAGTGAC TTGCCCATGG TTGCATCAAA TGAATGGCAG AGTCAGTTTC CATGTGATGT GCATTTAAGC CTTAATGCGC CTGGCCCTGC CTCCGCAGTG GCCGAGGTCT GGCAAGTAGA CATGGTCCGA CTAAATACAA GTCTTTCTGT TCCATGTTGT ATAGGAGCTG TCTTCGGCAG CCCCCTCCCA GCTAGTGTCA ATTCCAAGTA GGAGGGGTAG CGCAACGTCC GCCTGTGGTC TTTGGCGCCA ACTGGGTGGG GGCAGCGTGG GGGGCGGAGT TATCAGGCTG GAGGTACAGA CCAAGTTTCC TCCCTGGCGC CGGCCAGTCT GCGGACGGCC CCCGCCTCGG CACGCTCGGC GGAAACTGAC TGCTCCTTGG TCTTCTTTCC TCCCCCGCCC AGAACGCAGG TGCTGGCGCC CGTTCTGCCT GGGACCCCGG GAACCTCTCC TGCCGGAAGC CGGACGGCAG GGATGGGCCC CAACTTCGCC CTGCCCACTT GACTTCACCA AATCCCTTCC TGGAGACTAA ACCTGGTGCT CAGGAGCGAA GGACTGTGAA CTTGTGGCCT GAAGAGCCAG AGCTAGCTCT GGCCACCAGC TGGGCGACGT CACCCTGCTC CCACCCCACC CCCAAGTTCT AAGGTCTTTT CAGAGCGTGG AGGTGTGGAA GGAGTGGCTG CTCTCCAAAC TATGCCAAGG CGGCGGCAGA GCTGGTCTTC TGGTCTCCTT GGAGAAAGGT TCTGTTGCCC TGATTTATGA ACTCTATAAT AGAGTATATA GGTTTTGTAC CTTTTTTACA GGAAGGTGAC TTTCTGTAAC AATGCGATGT ATATTAAACT TTTTATAAAA GTTAACATTT TGCATAATAA ACGATTTTTA AACACTTGTG TATATGATGA CACCCGTCTC CATTAAGTAC TAATGATGCT TTCTCGCACA TGGCCGAATT TTGGGAGCTT TGGGAAAGTG AACTTGCTTA TTCTACGAGA GGGAAATGAA AAACTGCCTG GTTGAGAGGG GATGGGGTGG AGAGAGAAGG GTTCATGATG GGAGTCTCAT GTCCATTGAG GGATGGGTGC AGAGAAAAGT TCTGGCTCTG CCTCATTATT TCAGAGATGA AACCAGAGAC TGGTGCAAGC T
Sequence Name:
Copy and paste or upload the Protein file
MKALSPVRGCYEAVCCLSERSLAIARGRGKGPAAEEPLSLLDDMNHCYSRLRELVPGVPR GTQLSQVEILQRVIDYILDLQVVLAEPAPGPPDGPHLPIQTAELAPELVISNDKRSFCH