LOCUS NM_001085225 2208 bp mRNA linear PLN 20-OCT-2022
DEFINITION Arabidopsis thaliana transcription factor, putative (Protein of
unknown function, DUF547) (AT5G42690), mRNA.
ACCESSION NM_001085225
VERSION NM_001085225.2
DBLINK BioProject: PRJNA116
BioSample: SAMN03081427
KEYWORDS RefSeq.
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 2208)
AUTHORS Tabata,S., Kaneko,T., Nakamura,Y., Kotani,H., Kato,T., Asamizu,E.,
Miyajima,N., Sasamoto,S., Kimura,T., Hosouchi,T., Kawashima,K.,
Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
Nakazaki,N., Naruo,K., Okumura,S., Shinpo,S., Takeuchi,C., Wada,T.,
Watanabe,A., Yamada,M., Yasuda,M., Sato,S., de la Bastide,M.,
Huang,E., Spiegel,L., Gnoj,L., O'Shaughnessy,A., Preston,R.,
Habermann,K., Murray,J., Johnson,D., Rohlfing,T., Nelson,J.,
Stoneking,T., Pepin,K., Spieth,J., Sekhon,M., Armstrong,J.,
Becker,M., Belter,E., Cordum,H., Cordes,M., Courtney,L.,
Courtney,W., Dante,M., Du,H., Edwards,J., Fryman,J., Haakensen,B.,
Lamar,E., Latreille,P., Leonard,S., Meyer,R., Mulvaney,E.,
Ozersky,P., Riley,A., Strowmatt,C., Wagner-McPherson,C., Wollam,A.,
Yoakum,M., Bell,M., Dedhia,N., Parnell,L., Shah,R., Rodriguez,M.,
See,L.H., Vil,D., Baker,J., Kirchoff,K., Toth,K., King,L.,
Bahret,A., Miller,B., Marra,M., Martienssen,R., McCombie,W.R.,
Wilson,R.K., Murphy,G., Bancroft,I., Volckaert,G., Wambutt,R.,
Dusterhoft,A., Stiekema,W., Pohl,T., Entian,K.D., Terryn,N.,
Hartley,N., Bent,E., Johnson,S., Langham,S.A., McCullagh,B.,
Robben,J., Grymonprez,B., Zimmermann,W., Ramsperger,U., Wedler,H.,
Balke,K., Wedler,E., Peters,S., van Staveren,M., Dirkse,W.,
Mooijman,P., Lankhorst,R.K., Weitzenegger,T., Bothe,G., Rose,M.,
Hauf,J., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S.,
Villarroel,R., Gielen,J., Ardiles,W., Bents,O., Lemcke,K.,
Kolesov,G., Mayer,K., Rudd,S., Schoof,H., Schueller,C.,
Zaccaria,P., Mewes,H.W., Bevan,M. and Fransz,P.
CONSRTM Kazusa DNA Research Institute; Cold Spring Harbor and Washington
University in St Louis Sequencing Consortium; European Union
Arabidopsis Genome Sequencing Consortium
TITLE Sequence and analysis of chromosome 5 of the plant Arabidopsis
thaliana
JOURNAL Nature 408 (6814), 823-826 (2000)
PUBMED 11130714
REFERENCE 2 (bases 1 to 2208)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (19-OCT-2022) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 2208)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
REFERENCE 4 (bases 1 to 2208)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
COMMENT REVIEWED REFSEQ: This record has been curated by TAIR and Araport.
This record is derived from an annotated genomic sequence
(NC_003076).
On Sep 12, 2016 this sequence version replaced NM_001085225.1.
FEATURES Location/Qualifiers
source 1..2208
/organism="Arabidopsis thaliana"
/mol_type="mRNA"
/db_xref="taxon:3702"
/chromosome="5"
/ecotype="Columbia"
gene 1..2208
/locus_tag="AT5G42690"
/gene_synonym="MJB21.6; MJB21_6"
/db_xref="Araport:AT5G42690"
/db_xref="GeneID:834278"
/db_xref="TAIR:AT5G42690"
CDS 332..1954
/locus_tag="AT5G42690"
/gene_synonym="MJB21.6; MJB21_6"
/inference="Similar to RNA sequence,
EST:INSD:EG476451.1,INSD:EG476450.1,INSD:EG476444.1,
INSD:EG476434.1,INSD:EG476430.1,INSD:EG476471.1,
INSD:EL183353.1,INSD:EG476447.1,INSD:ES093773.1,
INSD:EG476468.1,INSD:EG476454.1,INSD:EG476449.1,
INSD:EG476436.1,INSD:EG476433.1,INSD:EG476437.1,
INSD:EG476443.1,INSD:EG476435.1,INSD:EG476440.1,
INSD:EG476445.1,INSD:EG476438.1,INSD:EG476466.1"
/inference="similar to RNA sequence, mRNA:INSD:DQ132740.1"
/note="Protein of unknown function, DUF547; LOCATED IN:
chloroplast; EXPRESSED IN: 17 plant structures; EXPRESSED
DURING: 9 growth stages; CONTAINS InterPro DOMAIN/s:
Protein of unknown function DUF547 (InterPro:IPR006869);
BEST Arabidopsis thaliana protein match is: Protein of
unknown function, DUF547 (TAIR:AT4G37080.3); Has 821 Blast
hits to 802 proteins in 158 species: Archae - 4; Bacteria
- 211; Metazoa - 31; Fungi - 2; Plants - 490; Viruses - 0;
Other Eukaryotes - 83 (source: NCBI BLink)."
/codon_start=1
/product="transcription factor, putative (Protein of
unknown function, DUF547)"
/protein_id="NP_001078694.1"
/db_xref="GeneID:834278"
/db_xref="TAIR:AT5G42690"
/db_xref="Araport:AT5G42690"
/translation="MMMSSSSSSSNGSSSYTRCVKTPSSTNTNIGKKLKGCENGVVNR
KALNREKIITLQEDVEKLRKKLRLEENIHRAMERAFSRPLGALPRLPPFLPPSVLELL
AEVAVLEEELVRLEEHIVHCRQELYQEAVFTSSSIENLKCSPAFPKHWQTKSKSASTS
ARESESPLSRAPCSVSVCRKGKENKLSATSIKTPMKKTTIAHTQLNKSLEAQKLKQDS
HRCRKTNAERSSHGGGDEPNKISEDLVKCLSNIFMRMSSIKRSMVTKSQENDKDTAFR
DPYGICSSFRRRDIGRYKNFSDVEEASLNQNRTSSSSLFLIRQLKRLLGRLSLVNMQK
LNQQEKLAFWINIYNSCMMNGFLEHGIPESPDMVTLMQKATINVGGHFLNAITIEHFI
LRLPHHSKYISPKGSKKNEMAVRSKFGLELSEPLVTFALSCGSWSSPAVRVYTASKVE
EELEVAKREYLEASVGISVVKIGIPKLMDWYSHDFAKDIESLLDWIFLQLPTELGKDA
LNCVEQGMSQSPSSTLVHIIPYDFTFRYLFSI"
ORIGIN
1 cgaccgtttc accgctcctt ttgatatccc tcacttattt ttattttttg ctctaatgaa
61 tacacaaaac aaatcaagga aaaaaaaaaa actcaccgaa gctgaagacg aaacatcacc
121 aaatcttcct cttctgcaac tgtctccatc tatacgtact aaattattca tttctctcta
181 catatataat attacaaaga aagtgattca acattacaca tgtacctttc tgtaatttca
241 aactttcaag aatgatcacc atggttgatg acacaagatc actgatccaa accagttttg
301 tattatacag cttttgattc ggaagacaat tatgatgatg agtagcagca gtagcagtag
361 taatggtagt agtagttata caagatgcgt taaaactcca tcatctacaa atactaacat
421 tgggaagaaa ctgaaaggtt gtgaaaatgg tgttgtaaat cgaaaggcgt taaatcgaga
481 gaagataatt actctgcaag aagatgttga aaagttgaga aagaagctga gacttgaaga
541 aaacattcac cgagcaatgg aaagagcatt tagtaggcct cttggagcac ttccccgtct
601 tcctccgttt ctacctccat cggttttgga gttacttgcg gaagtggctg ttttggaaga
661 ggaattggtt cggttagaag aacatattgt gcattgtaga caagaattgt atcaagaagc
721 agtttttaca tcttcatcga tagagaattt aaaatgctct cctgcatttc ctaagcattg
781 gcagaccaaa tccaagtcag cttccactag tgctagagaa tcagaatcac ctctttcacg
841 ggcaccttgt tctgtttcag tgtgtaggaa gggtaaagaa aacaagttga gtgctacttc
901 tatcaaaaca ccgatgaaga aaacgactat tgctcataca caactgaata agagtttaga
961 agctcaaaaa ctaaagcagg atagtcatag atgtcgcaaa acaaatgcag agcggagctc
1021 tcatggtggc ggtgatgaac caaataaaat ttccgaggat ttggttaaat gtctgtctaa
1081 cattttcatg agaatgagct caataaagag atccatggtt acaaaatctc aagaaaacga
1141 taaagatacg gcattcaggg atccttatgg aatttgctct agttttagaa gaagggatat
1201 tggtcgatat aagaatttca gtgatgttga agaagcctca cttaaccaaa acagaacgtc
1261 aagttcctct ttgtttctca tccgccaact gaaacgattg cttggaagac tttctttagt
1321 caacatgcag aaacttaatc aacaagagaa gttagctttc tggataaaca tttataacag
1381 ctgcatgatg aatggtttcc ttgaacatgg aataccggag agccctgata tggtgacatt
1441 aatgcaaaag gcgactataa atgtaggagg ccactttctc aatgcaatca cgatcgaaca
1501 cttcatcctc cgcttaccgc atcactcaaa atatatttct ccaaagggtt caaagaaaaa
1561 tgaaatggca gtgagaagta aatttggatt ggaattgtca gagccacttg taacatttgc
1621 tctctcatgt ggtagctggt cctcacccgc ggtacgggtg tacactgcga gtaaagtaga
1681 ggaggagttg gaggtggcga aaagagagta cctagaagca tcggtgggga tatcggtggt
1741 gaaaataggg ataccaaagc tgatggattg gtatagtcat gactttgcaa aggacattga
1801 atcattgctt gattggattt tccttcagtt gcctactgag ttgggtaaag atgctctcaa
1861 ctgtgttgaa caagggatgt ctcagtcccc ttcttctact cttgtccata ttatccctta
1921 tgacttcact tttagatatc ttttttccat ctgaaaatgt tttttttttc ctaagggggg
1981 ttgttttctt tcttttaatt ttttctctga ggttcaaata ggttttgttt acacttttac
2041 atcaacagaa aaatgtagat aaataaggaa aaatcgaacc tatattgtgt tcgagttagt
2101 ttggtgtttg ttttccaaca cacatcaatt tcaaagctga ttaggtatat ggcgagaaat
2161 gaaaaagtat ttagataaat gattattcct agagtttgta agtaggaa
//