LOCUS NM_119852 1468 bp mRNA linear PLN 20-OCT-2022
DEFINITION Arabidopsis thaliana cysteine proteinase1 (CP1), mRNA.
ACCESSION NM_119852
VERSION NM_119852.5
DBLINK BioProject: PRJNA116
BioSample: SAMN03081427
KEYWORDS RefSeq.
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 1468)
AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
Martienssen,R. and McCombie,W.R.
TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 769-777 (1999)
PUBMED 10617198
REFERENCE 2 (bases 1 to 1468)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (19-OCT-2022) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 1468)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
REFERENCE 4 (bases 1 to 1468)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
COMMENT REVIEWED REFSEQ: This record has been curated by TAIR and Araport.
This record is derived from an annotated genomic sequence
(NC_003075).
On Sep 12, 2016 this sequence version replaced NM_119852.4.
FEATURES Location/Qualifiers
source 1..1468
/organism="Arabidopsis thaliana"
/mol_type="mRNA"
/db_xref="taxon:3702"
/chromosome="4"
/ecotype="Columbia"
gene 1..1468
/gene="CP1"
/locus_tag="AT4G36880"
/gene_synonym="AP22.67; AP22_67; cysteine proteinase1;
RD21A-LIKE PROTEASE1; RDL1"
/db_xref="Araport:AT4G36880"
/db_xref="GeneID:829841"
/db_xref="TAIR:AT4G36880"
CDS 100..1230
/gene="CP1"
/locus_tag="AT4G36880"
/gene_synonym="AP22.67; AP22_67; cysteine proteinase1;
RD21A-LIKE PROTEASE1; RDL1"
/inference="Similar to RNA sequence,
EST:INSD:R84153.1,INSD:BP606289.1,INSD:AU228598.1,
INSD:AU237525.1,INSD:EH820043.1,INSD:BP811422.1,
INSD:EH914574.1"
/inference="Similar to RNA sequence,
mRNA:INSD:AK229031.1,INSD:AY043294.1"
/note="cysteine proteinase1 (CP1); FUNCTIONS IN:
cysteine-type peptidase activity, cysteine-type
endopeptidase activity; INVOLVED IN: proteolysis, response
to gibberellin stimulus, response to red light; LOCATED
IN: endomembrane system; EXPRESSED IN: 17 plant
structures; EXPRESSED DURING: 8 growth stages; CONTAINS
InterPro DOMAIN/s: Proteinase inhibitor I29, cathepsin
propeptide (InterPro:IPR013201), Peptidase C1A, papain
(InterPro:IPR013128), Peptidase C1A, papain C-terminal
(InterPro:IPR000668), Peptidase, cysteine peptidase active
site (InterPro:IPR000169); BEST Arabidopsis thaliana
protein match is: Granulin repeat cysteine protease family
protein (TAIR:AT5G43060.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0;
Other Eukaryotes - 2996 (source: NCBI BLink)."
/codon_start=1
/product="cysteine proteinase1"
/protein_id="NP_195406.2"
/db_xref="GeneID:829841"
/db_xref="TAIR:AT4G36880"
/db_xref="Araport:AT4G36880"
/translation="MAPSTKVLSLLLLYVVVSLASGDESIINDHLQLPSDGKWRTDEE
VRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTK
FTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI
KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQ
FIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPV
SVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEE
GYIRMERNLAASKSGKCGIAVEASYPVKYSPNPVRGNTISSV"
ORIGIN
1 aataccaaat tttttgttat aaatagtccc acagttttct catcctccag aaaaacaaat
61 catcccaagt aatcctaact atacaaaaca cataacaata tggctccttc aacaaaagtt
121 ctctctttac ttctcttata tgtcgtcgtg tcattagcct ccggtgatga gtccatcatc
181 aacgaccatc tccaacttcc atcggacggc aagtggagaa ccgatgaaga agtgaggtcc
241 atctacttac aatggtccgc agaacacggg aaaactaaca acaacaacaa cggtatcatc
301 aacgaccaag acaaaagatt caatattttc aaagacaact taagattcat cgatctacac
361 aacgaaaaca acaagaacgc tacttacaag cttggtctca ccaaatttac cgatctcact
421 aacgatgagt accgcaagtt gtacctcggg gcaagaactg agcccgcccg ccgcatcgct
481 aaggccaaga atgtcaacca gaaatactca gccgctgtaa acggcaagga ggttccagag
541 acggttgatt ggagacagaa aggagccgtt aaccccatca aagaccaagg aacttgcgga
601 agttgttggg cgttttcgac tactgcagca gtagaaggta taaacaagat cgtaacagga
661 gaactcatat ctctatcaga acaagaactt gttgactgcg acaaatccta caatcaaggt
721 tgcaacggcg gtttaatgga ctacgctttt caattcatca tgaaaaatgg tggcttaaac
781 actgagaaag attatcctta ccgtggattc ggcggaaaat gcaattcttt cttgaagaat
841 tctagagttg tgagtattga tgggtacgaa gatgttccta ctaaagacga gactgcgttg
901 aagaaagcta tttcatacca accggttagt gtagctattg aagccggtgg aagaattttt
961 caacattacc aatcgggtat ttttaccgga agttgtggta caaatcttga tcacgcggtg
1021 gttgctgtcg ggtacggatc agagaacggt gttgactact ggattgtaag gaactcttgg
1081 ggtccacgtt ggggtgagga aggttacatt agaatggaga gaaacttggc agcctccaaa
1141 tccggtaagt gtgggattgc ggttgaagcc tcgtacccgg ttaagtacag cccaaacccg
1201 gttcgtggaa atactatcag cagtgtttga aatcaagacg ctcgataaac atttgggatt
1261 cttataactg aatttaatct cgtattgtta ttattgtttg tatgtatagt atttcaacaa
1321 caatttcaaa aattagttga ttcatcataa ggatataaaa tttataaatc cttatgtcga
1381 tcaatttctt ttttattcaa agaaagattg tttgcttgtt ttatgtatta agagaaatat
1441 aataaaatga tatatttctt aagagcca
//