LOCUS NM_103587 2694 bp mRNA linear PLN 20-OCT-2022
DEFINITION Arabidopsis thaliana beta-galactosidase 5 (BGAL5), mRNA.
ACCESSION NM_103587
VERSION NM_103587.3
DBLINK BioProject: PRJNA116
BioSample: SAMN03081427
KEYWORDS RefSeq.
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 2694)
AUTHORS Theologis,A., Ecker,J.R., Palm,C.J., Federspiel,N.A., Kaul,S.,
White,O., Alonso,J., Altafi,H., Araujo,R., Bowman,C.L.,
Brooks,S.Y., Buehler,E., Chan,A., Chao,Q., Chen,H., Cheuk,R.F.,
Chin,C.W., Chung,M.K., Conn,L., Conway,A.B., Conway,A.R.,
Creasy,T.H., Dewar,K., Dunn,P., Etgu,P., Feldblyum,T.V., Feng,J.,
Fong,B., Fujii,C.Y., Gill,J.E., Goldsmith,A.D., Haas,B.,
Hansen,N.F., Hughes,B., Huizar,L., Hunter,J.L., Jenkins,J.,
Johnson-Hopson,C., Khan,S., Khaykin,E., Kim,C.J., Koo,H.L.,
Kremenetskaia,I., Kurtz,D.B., Kwan,A., Lam,B., Langin-Hooper,S.,
Lee,A., Lee,J.M., Lenz,C.A., Li,J.H., Li,Y., Lin,X., Liu,S.X.,
Liu,Z.A., Luros,J.S., Maiti,R., Marziali,A., Militscher,J.,
Miranda,M., Nguyen,M., Nierman,W.C., Osborne,B.I., Pai,G.,
Peterson,J., Pham,P.K., Rizzo,M., Rooney,T., Rowley,D., Sakano,H.,
Salzberg,S.L., Schwartz,J.R., Shinn,P., Southwick,A.M., Sun,H.,
Tallon,L.J., Tambunga,G., Toriumi,M.J., Town,C.D., Utterback,T.,
Van Aken,S., Vaysberg,M., Vysotskaia,V.S., Walker,M., Wu,D., Yu,G.,
Fraser,C.M., Venter,J.C. and Davis,R.W.
TITLE Sequence and analysis of chromosome 1 of the plant Arabidopsis
thaliana
JOURNAL Nature 408 (6814), 816-820 (2000)
PUBMED 11130712
REFERENCE 2 (bases 1 to 2694)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (19-OCT-2022) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 2694)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (18-JUL-2017) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
REFERENCE 4 (bases 1 to 2694)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
REFERENCE 5 (bases 1 to 2694)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
COMMENT REVIEWED REFSEQ: This record has been curated by TAIR and Araport.
This record is derived from an annotated genomic sequence
(NC_003070).
On Sep 12, 2016 this sequence version replaced NM_103587.2.
FEATURES Location/Qualifiers
source 1..2694
/organism="Arabidopsis thaliana"
/mol_type="mRNA"
/db_xref="taxon:3702"
/chromosome="1"
/ecotype="Columbia"
gene 1..2694
/gene="BGAL5"
/locus_tag="AT1G45130"
/gene_synonym="AtBGAL5; beta-galactosidase 5; F27F5.20;
F27F5_20"
/db_xref="Araport:AT1G45130"
/db_xref="GeneID:841080"
/db_xref="TAIR:AT1G45130"
CDS 337..2535
/gene="BGAL5"
/locus_tag="AT1G45130"
/gene_synonym="AtBGAL5; beta-galactosidase 5; F27F5.20;
F27F5_20"
/inference="Similar to RNA sequence,
EST:INSD:BP850164.1,INSD:BP607874.1,INSD:AV565146.1,
INSD:BP647649.1,INSD:AI994624.1,INSD:AV555896.1,
INSD:AV828682.1,INSD:BP799581.1,INSD:BP655057.1,
INSD:AV558229.1,INSD:BE528511.1,INSD:EH912843.1,
INSD:DR233759.1,INSD:ES102769.1,INSD:EG505771.1,
INSD:N96096.1,INSD:EG421291.1,INSD:EH800407.1,
INSD:BP803171.1,INSD:ES132696.1,INSD:EL190584.1,
INSD:AV830502.1,INSD:BP803036.1,INSD:EL979679.1,
INSD:AI100297.1,INSD:EL067723.1,INSD:EL984818.1,
INSD:BP599945.1,INSD:AV811953.1,INSD:ES042408.1,
INSD:BP831767.1,INSD:BP609957.1,INSD:ES192045.1,
INSD:ES073329.1,INSD:ES073101.1,INSD:AI994625.1,
INSD:BP656511.1,INSD:BP656326.1,INSD:BP810790.1,
INSD:BP848311.1,INSD:ES160359.1,INSD:AV565415.1,
INSD:EL071554.1,INSD:ES127677.1,INSD:BP595745.1,
INSD:BP608433.1,INSD:EL105731.1,INSD:CF652861.1,
INSD:BP668067.1,INSD:DR354461.1,INSD:AV806862.1,
INSD:ES192578.1,INSD:AV549694.1,INSD:BP603106.1,
INSD:ES190103.1,INSD:ES156679.1,INSD:ES195340.1,
INSD:BP835778.1,INSD:BP656331.1"
/inference="similar to RNA sequence,
mRNA:INSD:AY093977.1,INSD:AY058098.1,INSD:AJ270301.1,
INSD:BX817895.1,INSD:AY069911.1"
/note="beta-galactosidase 5 (BGAL5); FUNCTIONS IN: cation
binding, beta-galactosidase activity, hydrolase activity,
hydrolyzing O-glycosyl compounds, catalytic activity;
INVOLVED IN: lactose catabolic process, using glucoside
3-dehydrogenase, carbohydrate metabolic process, lactose
catabolic process via UDP-galactose, lactose catabolic
process; LOCATED IN: endomembrane system; EXPRESSED IN: 24
plant structures; EXPRESSED DURING: 13 growth stages;
CONTAINS InterPro DOMAIN/s: Glycoside hydrolase, family
35, conserved site (InterPro:IPR019801), Glycoside
hydrolase family 2, carbohydrate-binding
(InterPro:IPR006104), Glycoside hydrolase, family 35
(InterPro:IPR001944), Glycoside hydrolase, catalytic core
(InterPro:IPR017853), Glycoside hydrolase, subgroup,
catalytic core (InterPro:IPR013781), Galactose-binding
domain-like (InterPro:IPR008979); BEST Arabidopsis
thaliana protein match is: beta-galactosidase 3
(TAIR:AT4G36360.1); Has 2206 Blast hits to 2062 proteins
in 469 species: Archae - 15; Bacteria - 946; Metazoa -
364; Fungi - 218; Plants - 593; Viruses - 0; Other
Eukaryotes - 70 (source: NCBI BLink)."
/codon_start=1
/product="beta-galactosidase 5"
/protein_id="NP_175127.1"
/db_xref="Araport:AT1G45130"
/db_xref="GeneID:841080"
/db_xref="TAIR:AT1G45130"
/translation="MGTTILVLSKILTFLLTTMLIGSSVIQCSSVTYDKKAIVINGHR
RILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLV
RFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDNGPFKSAMQGFTE
KIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLGPAGHSYVNWAAKMAVGLNTGVP
WVMCKEDDAPDPIINTCNGFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGTVPKRPVED
LAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYS
HLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHMNAPAKVV
FNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQMVPSGSILYSVARYDEDIATY
GNRGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASESFLRGGKWPTLTVDSAGHAVH
VFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFETWATGI
VGSVVLHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQNKQPL
TWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAFAKGDCGSCNYAGTYRQ
NKCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRSVN"
ORIGIN
1 attttctaca aaacccatag aaaagcaatc acttttccac cacaataaaa tcgaaaacaa
61 acattttgta tgtatatgta aaagttaatg caaacattaa tttaaaacta tatggaaagc
121 tccattaatt tatttttaat ctcccaacta attcactaca ccaattttgt atgagagtag
181 gcgacgaaac cacaagtacc tatatcagag aatcagagac caacgtcact tatttacaca
241 ctctgttttc ttttttttgg ttgcttcttc tctgatcttt ctttcttttt ttttttttga
301 aagaatcaat ttccctctca aatcagaaat ctaaaaatgg gaacaacgat cttggttctc
361 tccaagatcc taactttctt actaacaaca atgctgattg gatccagtgt gatccagtgt
421 agtagtgtaa cctacgataa gaaagctatc gtcatcaatg gccaccgtag aatcctcctc
481 tctggttcta ttcactaccc aagaagcact cctgagatgt gggaagatct tataaagaaa
541 gctaaagacg gtggcttgga tgttattgat acttatgttt tctggaatgg tcatgaacct
601 tctcctggaa cttacaattt cgaaggaaga tatgatttgg taagattcat taagacgatt
661 caggaagtgg gtctttatgt tcatctcaga attggtcctt atgtttgtgc agaatggaat
721 tttggagggt ttcctgtgtg gttgaagtat gtagatggga ttagtttcag aactgacaat
781 ggacccttca agtctgcaat gcaaggattc acagagaaga ttgttcagat gatgaaagaa
841 catagattct ttgcgtcaca aggtggacct atcattcttt ctcagattga gaatgagttt
901 gaaccagatc ttaaagggct tggaccggct ggtcactcgt atgtcaattg ggctgcgaaa
961 atggcggtcg gtttaaacac gggagtaccg tgggtgatgt gcaaggaaga tgatgcacct
1021 gacccaatta taaacacttg caacggattc tactgtgatt attttactcc aaataaaccg
1081 tataagccaa ctatgtggac agaggcatgg agtggctggt tcacagagtt tggtggaact
1141 gttcctaaac gacctgtaga ggatctagca tttggagtag ctcgtttcat tcaaaagggc
1201 ggttcgtata taaactacta catgtaccat ggaggaacaa actttggacg caccgcagga
1261 ggtccattta tcaccactag ttatgactat gatgctccta tcgatgaata cgggttggtc
1321 caagagccca agtacagtca tcttaagcag cttcatcagg caatcaagca atgtgaagct
1381 gccttagttt cttctgatcc acatgttact aaactaggaa actacgagga ggctcatgtg
1441 ttcactgctg gcaaaggaag ttgtgtagct ttcttaacga actatcacat gaatgcacct
1501 gcaaaagtag tgttcaataa ccgacactat actctacctg cttggtccat cagcattctt
1561 ccagattgta gaaacgttgt tttcaacact gcgacggttg ctgcaaagac atcacatgtg
1621 caaatggtgc catctggttc catattgtac tcggttgcta gatacgatga agatattgct
1681 acttatggaa accgtgggac aatcacagct cgtggattgt tggagcaggt taatgttaca
1741 cgagatacaa ctgattacct gtggtacaca accagtgtgg atattaaggc atcagaatca
1801 ttcttgcgtg gaggaaaatg gccaactctt acagtggatt ctgcagggca tgctgttcat
1861 gtgtttgtca atggacattt ttacggatct gcctttggaa caagagaaaa cagaaagttt
1921 tcatttagct cccaagtcaa tctacgaggt ggagctaaca aaatcgcact actgagtgta
1981 gcagttgggt tgccgaatgt tggaccacat tttgagacgt gggccacagg aatcgtcggg
2041 tctgtggtgc ttcatggcct tgacgagggt aacaaagact tgagttggca gaaatggact
2101 tatcaggctg gtctgcgagg ggaatcaatg aacttggtct ctcctactga ggactcctct
2161 gttgattgga tcaaaggctc attggctaag caaaacaaac aacctttgac atggtacaag
2221 gcctattttg acgcgcctag agggaacgag ccgctggctt tggatctaaa gagtatgggg
2281 aaagggcaag cttggataaa cgggcaaagc atagggagat actggatggc atttgccaaa
2341 ggagactgcg gaagttgtaa ctacgctgga acatacaggc agaacaaatg ccagtctggt
2401 tgtggcgagc cgacacaaag atggtatcat gttccgcgtt cgtggttgaa gccaaaaggg
2461 aacttgttag tactttttga agaacttggt ggagatatat ccaaagtctc tgttgtgaag
2521 agatcagtaa actaacttaa aaaataaaac caatattgtg tgtgcattac actaaaagca
2581 ttactcgtgc tgtgtttaat ttatccctct ttgtgaacag aaaccatatt tgaaactcga
2641 gatagttata atacaaaatg tgtagtaaaa atgcatcatt ttattatata ccaa
//