Warning: The NCBI web site requires JavaScript to function. more...
An official website of the United States government
The .gov means it's official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you're on a federal government site.
The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.
Download features.
Download gene features.
NCBI Reference Sequence: NC_041774.1
FASTA Graphics
LOCUS NC_041774 7785 bp DNA linear CON 26-APR-2019 DEFINITION Macaca mulatta isolate AG07107 chromosome X, Mmul_10, whole genome shotgun sequence. ACCESSION NC_041774 REGION: 48787063..48794847 VERSION NC_041774.1 DBLINK BioProject: PRJNA528504 BioSample: SAMN09435472 Assembly: GCF_003339765.1 KEYWORDS WGS; RefSeq. SOURCE Macaca mulatta (Rhesus monkey) ORGANISM Macaca mulatta Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. COMMENT REFSEQ INFORMATION: The reference sequence is identical to CM014356.1. Assembly name: Mmul_10 The genomic sequence for this RefSeq record is from the whole-genome assembly released by The Genome Institute at Washington University School of Medicine on 2019/02/13. The original whole-genome shotgun project has the accession QNVO00000000.2. ##Genome-Assembly-Data-START## Assembly Provider :: The Genome Institute at Washington University School of Medicine Assembly Method :: HGAP4_SMRT_Link v. 5.0.1.9585 Assembly Name :: Macaca mulatta 2.0 Genome Coverage :: 66x Sequencing Technology :: PacBio RSII ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Name :: Macaca mulatta Annotation Release 103 Annotation Version :: 103 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 8.2 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7785 /organism="Macaca mulatta" /mol_type="genomic DNA" /isolate="AG07107" /bio_material="Coriell:AG07107" /db_xref="taxon:9544" /chromosome="X" /sex="female" /tissue_type="fibroblast" gene 1..7785 /gene="GATA1" /note="Derived by automated computational analysis using gene prediction method: Gnomon." /db_xref="GeneID:714631" mRNA join(1..116,4582..4820,5336..5713,5815..5960,6661..6786, 7267..7785) /gene="GATA1" /product="GATA binding protein 1" /note="Derived by automated computational analysis using gene prediction method: Gnomon. Supporting evidence includes similarity to: 3 mRNAs, 23 ESTs, 2 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 10 samples with support for all annotated introns" /transcript_id="XM_015127373.2" /db_xref="GeneID:714631" CDS join(4601..4820,5336..5713,5815..5960,6661..6786, 7267..7638) /gene="GATA1" /note="Derived by automated computational analysis using gene prediction method: Gnomon." /codon_start=1 /product="erythroid transcription factor" /protein_id="XP_014982859.2" /db_xref="GeneID:714631" /translation="MEFPGLGTLGTSEPLPQFVDPALVSSTPESGVFFPSGPEGLDAA ASSTGPSTATAAAAALAYYRDAEAYRHSPVFQVYPLLNCMEGIPGGSPYASWAYGKTG LYPASTVCPTREDSPPQATEDPDGKGSTSFLETLKTERLSPDLLTLGPALPSSLPIPN SAYGGPDFSSTFFSPTGSPLSSAAYSSPKLRGTLPLPPCEARECVNCGATATPLWRRD RTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCTNCQTTTTTLWRRNASGD PVCNACGLYYKLHQVNRPLTMRKDGIQTRNRKASGKGKKKRGSSLGGTGAAEGPAGGF MVVAGGSGSGNCGEVTSGLTLGPPGTAHLYQGLGPVVLSGPVSHLMPFPGPLLGSPTG SFPTGPMPPTTSATVVAPLSS" ORIGIN 1 gggagggagg gaggaaggga gcctcaaagg ccaaggccag ccaggacacc ccctgggatc 61 acactgagct tgccacatcc ccaaggcggc tgaaccctct gcaacaacca gcccaggtca 121 gtctcagccc ccactgagct cccaccaagg caaccctggg cctgctacct ctacctttat 181 ggctacccac ccctcagctt gctccaggct ccccaaccca cggctggtcc ccaaagtgtc 241 agccagccct gaatcccttt acagctcctc agacccatct gtctttccaa ctcccatttc 301 taccccaaat ctacccccta aaccccacct ctttctgccc tcccatccat tccgtccaag 361 atatctatat cctctggatc ctttcacttt gctccctata cccctaggtc tgtctcccct 421 aaacactaac tctgcctctc aaaacatgac tttatcccct gaaatccttt ccctgtccca 481 tatctgattt ccactcaccc acacttgtct ctgtcttcct gacttctcct ccaaacctca 541 cttttctgct cccaaacccc ttatccgctc tataacacct tgtctttgcc ccactctctc 601 tgagccccca aacctgtctc tgatctgcca aaccctagct ccatcctctt accccatctc 661 tccctctcaa acccactctt tgttcccaat gacttctctc tgatctcccc aaaccatttc 721 tctctcccaa ctccctgcct actcagtaaa atatctttgt cactccaaat accaagtcga 781 ctccccaaaa tctgttctgc cgcatgaacc ctcatgtctg ccttcccaaa accctatctc 841 agccctaaaa cacatctctt tgtctccctg aatctctcca gctccaagtc tttctctact 901 ccaattctat cttggcttct caaactcccc tttaatcact agccacattt ctgtctccaa 961 acagatcttt cactcctaaa ctccatctct cctccagaaa aacccttgaa ctccccagat 1021 ttttctacga ccctcagaat ctgactccca cactactcct caactttatt tctgactctc 1081 aaacttcgcc tctgatcccc caacctcgtc tgttttcaca acctattcct gagcccaaaa 1141 tgtgtcgctg aacttccaag cttgtctctg ctccataaac ccccatatct gccttccccc 1201 aaactccatg gctacacccc aaacctctct ctgacttaac cccatatctg accctcaaac 1261 tccacctcta tctcccccca agccccgtat ctgctcccta gaccttatat ctgctccctt 1321 cagccccgtg agaccctccc ttgagggcta agattggaat gttacctggg gtgaggggct 1381 ggggccgagg gagagtggag ggtgccggct gctgcctgct gccaccgttg gggcaacact 1441 tactctagtg gggcagctga taaggagctt tcatatcccc cagcctcgag ataaacttta 1501 tctctgtccg gagagtgata actgggggtg agggggctgg gcccctgcca tgggaggggt 1561 gggcagccct ggactcacca aggtgtgaag tgcaggggtt gggggcagtg gggctagagg 1621 gagatcatgg tgtgtaaggt ggggttgagg ggatgaggga atagtaggtg aaagaaagag 1681 agcaggagac acacaatgag acagggacgg gaaggagaaa cagagggaga gatgggaaga 1741 gagagacaga gacacagaga caaagagcaa gagacaaaag agacagaaag agaaaggaaa 1801 aacagagaga gagaaactta atgagtaaga gacagagaca gacaaacggc cggagaaaaa 1861 gaaaaaaaga cccaggagca gaaaaggaga tagaaatgca gagatacccg caggtagcac 1921 tgtgaagaga cagagactaa cacagaatta gagacagaga ggaaagagat agagaccaag 1981 atggggaagc aagacactct cttcagagag caacagcagg caccctgggg ctgtgaaggt 2041 gttggtggtg ggcacacttg tggcccatca ctttcctagc acaatcccca agcctgggcc 2101 tgccccttag cctgtctata ccatgggtca tgtaagcagg ggtgaagggc agtttgaaga 2161 cagccttcta gtctagccct gtcactccac cctacagaca cagaaacaga gcaggccagc 2221 acagtagggc cagggccagc attccaggct ccttatcccg agctgcatgt ggctgggtac 2281 ggttaggggt atgagactac tacgaggcac atggacataa ggtcagagga cccaaagaga 2341 ggggaacaga gggaagagac aaacaagcgg gaagaatggg agacatgcag agatgttcag 2401 agagatagag gaaagggaga aagcaagagt aacagagaga gccagagaga gaaacatgga 2461 aagacagaaa acaagacaga aaatcatcaa acatttataa aagcacagaa agaggccagg 2521 cgcggtggct cacacctata atctcagcac tttgggaggc catggtgggc agatcacttg 2581 aggtcaggag ttcaagatca gcctggccaa catggcaaaa ccccatctct actaaaaata 2641 caaaaattag ctgggtgtgg tggcacgtgc ctgtaatccc agctactagg gaggctgagg 2701 catgagaatc acttgaacct gggaggctga ggttgcagtg agccaagatc gcaccactgc 2761 acgccagcct gggtgacaga gtgagattct gttttaaaaa aaaaaaaaaa agcacaaaag 2821 gagaagacaa gacaagctat atcatagaaa cacagaaagc tgggggagct acagagtcag 2881 acttgagagg aaacagtcag gggaaagagc cctgggggac agagaggtgg agagctagat 2941 gacagagaca cacacaaggc aaaatagagg caaagggcca tcccacaagt ctcagctcag 3001 ctgcagccca tcaccacccc caatacagca gatggggaaa ctgaggcctg ggaactgaaa 3061 gagcctgaaa gcagaactat ggtgggcctc ccaaggggaa agggtaccgg tagaggtacc 3121 tcccacctgt caccctcttc agaggaagtc agtggccttg gggtgatttc aaaagttggg 3181 cggggaaggc agagataagc agtggggggt actgaccccc ccacaccaag aagtacggag 3241 gaactaaggg ggccttctgt ctgtagatgc aacggaggtg gagggaggag ggagtcaagc 3301 ccggaaacca tggggtttct gagaaagtta gagggcaaga tacaacagat agggatgaag 3361 ttggggagca gaggatggtg aaccccaaag tcctggggga agtgaccaag aggctcaagg 3421 gactctggtc ctgcacccca tcccaccttc acccagactc ttctagaggg ggagggaaga 3481 aagaatgggg ggatggggag aataagagga agtggaggag gggaggagag gagatgggga 3541 aagggagaaa aagatgaaca gaaaagggag agaggagaag gatgaagaaa atggagggga 3601 ggagaaaatg agtagtagga gaaaggagtg gagagtagat gtagacaggt gaagagaggc 3661 cgaacatggt gactcatacc tgtaatccca gtactttggg aggctgaggt gggaagatca 3721 cttgaggcca ggagttcaag accagcctgg gcaacatagt tataccgcat ctctacaaaa 3781 aaaaccaaaa actagccagg cgtggtgaca tacacctgta gttccagtta ctcaggaggc 3841 tgagacagga ggatcacttg cgtctgggag gcagtgggtg tagtgagccg agatcatgcc 3901 actgcactcc agcctggcaa cagaggagac cctgcctcaa aaaaaaaaag tggtgtgtgc 3961 ggatggggga gagggagata aggtgtgtga ggaagatata aaaggagaat gaagacaaaa 4021 agaggaggac aaggaagagg ggaaacagga aaggaaaagg aggaggtcaa tgaggaaaag 4081 aagaaggtag agaagaaaaa ggaagtagga gaggagaatg agaaaagagt ggaaagagaa 4141 ggaagtgggg aagaggggga caaagaagag atgggaagaa gagagatggg agaaagaggg 4201 ggaaaaggga agagggggag gaagagaatg gaaggaagaa gggagaggta ggagcagatg 4261 aaagtggagg aagagcgaga ggagggagga gaggaaagac aaacggggca agatagggag 4321 aatgaggagg aagatgaaag gaggggcaca caggagtgga agggggagat gcaggaggaa 4381 agagaggaga gggaggaaga gaagaagggg aaggagaagg gagactaagg tgaggaggag 4441 gaggggaatg ggaggtggga aggaggaata tggaaactga ggtgatggag tgggaggagg 4501 gggaaggagg gaagaggagc agatgaaagg aggtgaagga cagaggattt ctgtgtctga 4561 ggaccccttc tgtcttcgca ggttaacccc cagaggctcc atggagttcc ctggcctggg 4621 gaccctgggg acctcagagc ccctccccca gtttgtggat cctgctctgg tgtcctccac 4681 gccagaatca ggggttttct tcccctctgg gcctgagggc ttggatgcag cagcttcctc 4741 caccggccca agcacagcca ccgctgcagc tgcagcgctg gcctactaca gggacgctga 4801 ggcctacaga cactctccag gtaactccat tgggtggctg tcttggcatt ggctgagtgc 4861 tgttggggtt gccatggaga tccttggcta ggtcagagta ccactgtgag gatatctcag 4921 aaaaggctgg aagcttctca aatggatgtg ccgaccactt tccctagtta agggcagacc 4981 tgggaattcc aatgctcctc aacctgccac attggggcga ccacactgaa aggcaatatt 5041 ggaagtatgt ggtggttgcc ctagttgttg agtgatctgt ggagctccaa atcccaacag 5101 tcatcctcaa aagcccactt ggaaatggtc ataggttatt gcagaggcca cactgaccag 5161 tgggggtcag gatccaggaa gcatccaatg gccagcagct gttctggcag cctgtggaaa 5221 agctgggaac ttggccacca tgttgggggt gccgggaacc actgcaccct gacatgggct 5281 gaccctagac tgattttgcc tcttctttcc tctgtcccta cctgcccccc aacagtcttt 5341 caggtgtacc cattgctcaa ctgtatggag gggatcccag ggggctcacc atatgccagc 5401 tgggcctacg gcaagacggg gctctaccct gcctcaactg tgtgtcccac ccgcgaggac 5461 tcccctcccc aggccacgga agatccagat ggaaaaggca gcaccagctt cctggagact 5521 ttgaagacag agcggctgag cccagacctc ttgaccctgg ggcctgcact gccttcatca 5581 ctccctatcc ccaatagtgc ttatgggggc cctgactttt ccagtacctt cttttctccc 5641 accgggagcc ccctcagttc agcagcttat tcctctccca agcttcgtgg aactctgccc 5701 ctgcctccct gtggtgagga actcaaaaaa ggacagggaa gtcgaggtgg gaagggtggc 5761 tcaaagtaaa gctgagctaa gcctagctcc ctcttctcct cttcacccca ccagaggcca 5821 gggagtgtgt caactgcgga gcaacagcca ctccactgtg gcggagggac aggacaggcc 5881 actatctatg caatgcctgc ggcctctatc acaagatgaa tgggcagaac aggcccctca 5941 tccggcccaa gaagcgcctg gtaagaccac agacctgctt caccatatac acaggaaccc 6001 ctgtcctcat cctgtacaag cagccacctc ttctcattgt ataggaacgc tgttctcatg 6061 cttacaggaa gctactgcag gccaccctgt acaggaaccc ctgtcccaat atcacatgga 6121 aacctttgcc ttcatcctac acaggaaaat ttcattccgt atggaaatcc ttgccctcac 6181 cctacataag aactcttgcc ctttccctgt acaggaaccc agatctatag aagctcctat 6241 cctcaaccta ccctcatagg cttcaaacag aaacctttgt cctcaccctg tacaccagtg 6301 ttttcgtcct tggcttcaca ctggaatcac cccaggtccc ttctcccaga tgtgctgatg 6361 gaattggact ggggtgctgc ctgggcatca ggatatttta gctccccaag tgattctgat 6421 acacagccaa agctgagaac caccaatgta gtctcccata gaatacagaa caaaagaacc 6481 ttataccaga caggaacccc tgtactggcc ccatacagaa tcctcaggca tcaccttgta 6541 cacaaagccc tgccttcaac ctgaccctca cttcttgggt cctcctgaca tcccctgtga 6601 gcctcttacc cccacttcca catccccagg gcactgatct cacatcctgt tgtcccctag 6661 attgtcagca aacgggcagg tactcagtgc accaactgcc agacgaccac cacgacactg 6721 tggcggagaa atgccagcgg ggaccccgtg tgcaatgcct gcggcctcta ctacaagcta 6781 caccaggtga caccctgccc cttggagcca cccctctgct ttccctgtct tcatgccaca 6841 ctgccccgga ctctgttcct gttccacctt ctctctttcc ccacaaccct ctttcttctt 6901 tccccttcct cctttccctc cttcctcccc attccctcat ctcttctact ttccccctgc 6961 tctgttatta ccctccctcc ttccttctcc attccctcct cctccaccct cctccctcct 7021 ctcctctccc ctacttccag gcatcagtta atgtccaccc ccctgtccta gaccttgggc 7081 agctcctatc agccttggag gctttcccaa gctcagggtc tcacaacctc aggattcctt 7141 aagacaatct ctgcacccca aaattatctt accctgaagg aagtggggta gagagggtgt 7201 ccctggttga gacacagaga tgcaaaggtc tggagttggg gacacccgca gcctcccttt 7261 tggcaggtga accggccact gaccatgcgg aaggatggta ttcagactcg aaaccgcaag 7321 gcatctggaa aagggaaaaa gaaacggggc tctagtctgg gaggcacagg agcagccgaa 7381 ggaccagctg gtggctttat ggtggtggct gggggcagtg gtagcgggaa ttgtggggag 7441 gtgacttcag gcctgacact gggcccccca ggtactgccc atctctacca aggcctgggc 7501 cctgtggtgc tgtcagggcc tgttagccac ctcatgcctt tccctggacc cctgctgggc 7561 tcacccacag gctccttccc cacaggcccc atgcccccca ccaccagcgc tactgtggtg 7621 gctccgctca gctcgtgagg gcacagagca cggcctccag aggaggggtg gtgtcctcct 7681 cctcttgtag ccagctgtct ggacaaccca agtctctggg ccctaggcac cccatggcct 7741 gaaccttcaa agcttttgta aaataaaacc atcaaagtcc tgaaa //
Whole sequence (abbreviated view) Selected region from: to:
All features Gene, RNA, and CDS features only
Show reverse complement Show gap features
Your browsing activity is empty.
Activity recording is turned off.
Turn recording back on