LOCUS AC069299_7 478 aa linear PLN 25-OCT-2000
DEFINITION hypothetical protein [Arabidopsis thaliana].
ACCESSION AAG26081
VERSION AAG26081.1
DBSOURCE accession AC069299.2
KEYWORDS .
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (residues 1 to 478)
AUTHORS Lin,X., Kaul,S., Town,C.D., Benito,M.-I., Creasy,T.H., Haas,B.J.,
Wu,D., Maiti,R., Ronning,C.M., Koo,H., Fujii,C.Y., Utterback,T.R.,
Barnstead,M.E., Bowman,C.L., White,O., Nierman,W.C. and Fraser,C.M.
TITLE Arabidopsis thaliana chromosome 1 BAC T1E4 genomic sequence
JOURNAL Unpublished
REFERENCE 2 (residues 1 to 478)
AUTHORS Town,C.D. and Kaul,S.
TITLE Direct Submission
JOURNAL Submitted (25-MAY-2000) The Institute for Genomic Research, 9712
Medical Center Dr, Rockville, MD 20850, USA, cdtown@tigr.org
REFERENCE 3 (residues 1 to 478)
AUTHORS Town,C.D. and Kaul,S.
TITLE Direct Submission
JOURNAL Submitted (25-OCT-2000) The Institute for Genomic Research, 9712
Medical Center Dr, Rockville, MD 20850, USA, cdtown@tigr.org
COMMENT Address all correspondence to:at@tigr.org
BAC clone T1E4 is from Arabidopsis thaliana chromosome 1
The orientation of the sequence is from SP6 to T7 end of the BAC
clone.
Genes were identified by a combination of several methods: Gene
prediction programs including Genscan+ (Chris Burge,
http://CCR-081.mit.edu/GENSCAN.html), GeneMarkHMM (Mark Borodovsky,
http://genemark.biology.gatech.edu/GeneMark/), GlimmerA (a variant
of GlimmerM, see Mihaela Pertea,
http://www.tigr.org/softlab/glimmerm_htm/glimmerm.html, and
GeneSplicer (Mihaela Pertea and Steven Salzberg, contact
mpertea@tigr.org), searches of the complete sequence against a
peptide database and the plant EST database at TIGR
(http://www.tigr.org/tdb/tgi.shtml). Annotated genes are named to
indicate the level of evidence for their annotation. Genes with
similarity to other proteins are named after the database hits.
Genes without significant peptide similarity but with EST
similarity are named as unknown proteins. Genes without protein
or EST similarity, that are predicted by more than two gene
prediction programs over most of their length are annotated as
hypothetical proteins. Genes encoding tRNAs are predicted by
tRNAscan-SE (Sean Eddy, http://genome.wustl.edu/eddy/tRNAscan-SE/).
Simple repeats are identified by repeatmasker (Arian Smit,
http://ftp.genome.washington.edu/RM/RepeatMasker.html).
Method: conceptual translation.
FEATURES Location/Qualifiers
source 1..478
/organism="Arabidopsis thaliana"
/db_xref="taxon:3702"
/chromosome="I"
/map="mi423"
/clone="T1E4"
/ecotype="Columbia"
Protein 1..478
/product="hypothetical protein"
Region 7..>453
/region_name="PLN00113"
/note="leucine-rich repeat receptor-like protein kinase;
Provisional"
/db_xref="CDD:215061"
Region 128..151
/region_name="leucine-rich repeat"
/note="leucine-rich repeat [structural motif]"
/db_xref="CDD:275380"
Region 152..175
/region_name="leucine-rich repeat"
/note="leucine-rich repeat [structural motif]"
/db_xref="CDD:275380"
Region 176..199
/region_name="leucine-rich repeat"
/note="leucine-rich repeat [structural motif]"
/db_xref="CDD:275380"
Region 200..223
/region_name="leucine-rich repeat"
/note="leucine-rich repeat [structural motif]"
/db_xref="CDD:275380"
Region 224..248
/region_name="leucine-rich repeat"
/note="leucine-rich repeat [structural motif]"
/db_xref="CDD:275380"
Region 249..272
/region_name="leucine-rich repeat"
/note="leucine-rich repeat [structural motif]"
/db_xref="CDD:275380"
Region 273..296
/region_name="leucine-rich repeat"
/note="leucine-rich repeat [structural motif]"
/db_xref="CDD:275380"
Region 297..319
/region_name="leucine-rich repeat"
/note="leucine-rich repeat [structural motif]"
/db_xref="CDD:275380"
Region 348..393
/region_name="leucine-rich repeat"
/note="leucine-rich repeat [structural motif]"
/db_xref="CDD:275380"
Region 394..438
/region_name="leucine-rich repeat"
/note="leucine-rich repeat [structural motif]"
/db_xref="CDD:275380"
CDS 1..478
/gene="T1E4.2"
/coded_by="complement(AC069299.2:16951..18387)"
ORIGIN
1 mnsccfplfi fafviflrcl spieaatchp ddeagllafk sgitqdptgi lsswkkgtdc
61 cswkgvgclt nrvtglting qsdvtgsfls gtispslakl qhlvgiyftn lrnitgsfpq
121 flfqlpnvkq vyftnsrlsg plpanigals elgelsldgn lftgpipssi snltrlylln
181 lgdnlltgti plglanlkil lslnfgnnrl setipdifks mqklqsltls rnkfsgnlpp
241 siaslkpiln yldlsqnnls gtiptflsnf kvldsldlsr nrfsgvvpks lanmpklfhl
301 nlshnfltgp lpamknvdgl atldlsynqf hlktipkwvt sspsmyslkl vkcginmsld
361 nwkpvrpniy fyidlsenei sgsltwffnl ahnlyefqas gnklrfdmgk lnlserlesl
421 dlsrnlifgk vpmtvaklqk lnlshnhlcg klpvtkfpas afvgndclcg splspcki
//