JavaScript is disabled in your web browser

You must have JavaScript enabled in your web browser to use the Genome Browser

UCSC Genome Browser Acknowledgments

The data and software displayed on this site are the result of a large collaborative effort among many individuals at UCSC and at research institutions around the world. We'd like to acknowledge the scientists and engineers who have contributed to the UCSC Genome Browser and its data, and the organizations who have made this project possible through their generous funding.

Human

Deuterostomes

C. intestinalis
Lancelet
S. purpuratus

Insects

A. gambiae
A. mellifera
D. ananassae
D. erecta
D. grimshawi
D. melanogaster
D. mojavensis
D. persimilis
D. pseudoobscura
D. sechellia
D. simulans
D. virilis
D. yakuba

Nematodes

C. brenneri
C. briggsae
C. elegans
C. japonica
C. remanei
P. pacificus

Other

Yeast
California sea hare

Viruses

Ebola

Acknowledgment of early support

UCSC Genome Browser project team

Human Genome Browser acknowledgments

The UCSC Human Genome Browser is generated by the UCSC Genome Bioinformatics Group in collaboration with the International Human Genome Project. The browser project is primarily funded by grant U24HG002371 from the National Human Genome Research Institute.

We work with collaborators at many other institutions to produce and annotate this reference sequence of the human genome. To view complete acknowledgments for a specific annotation, see the "Credits" section on the track's description page in the Genome Browser. For a list of browser-related publications, see our publications page.

UCSC Human Genome Browsers:

hs1 (T2T CHM13v2.0): Hiram Clawson, Brian Raney, Jairo Navarro
hg38 (GRCh38): Hiram Clawson, Brian Raney, Robert Kuhn, Donna Karolchik, Steve Heitner
hg19 (GRCh37): Hiram Clawson, Brooke Rhead, Pauline Fujita, Ann Zweig, Katrina Learned, Donna Karolchik and Robert Kuhn
hg18: Engineering effort led by Fan Hsu; QA effort led by Ann Zweig
hg17: Hiram Clawson, Terry Furey, Heather Trumbower, Robert Kuhn, Donna Karolchik, Kate Rosenbloom, Angie Hinrichs, Rachel Harte, and Jim Kent
hg16: Terry Furey, Hiram Clawson, Heather Trumbower, Mark Diekhans, Robert Baertsch, Donna Karolchik, Jim Kent
hg15 (archived): Jim Kent, Terry Furey, Matt Schwartz, Heather Trumbower, Angie Hinrichs, Fan Hsu, Donna Karolchik, Jorge Garcia, Patrick Gavin, Chuck Sugnet, Yontao Lu, Mark Diekhans, Ryan Weber, Robert Baertsch, Krishna Roskin, and many other students in the UCSC Genome Bioinformatics group
Earlier browsers (archived): UCSC Genome Bioinformatics staff and students

Human genome data sources

Recent versions of the human genome sequence, starting with the hg19 (GRCh37) assembly, are provided by the Genome Reference Consortium (GRC), whose goal is to correct the small number of regions in the reference that are currently misrepresented, to close as many remaining gaps as possible and to produce alternative assemblies of structurally variant loci when necessary.

Prior to hg19, the human genome sequence data used by the browser were generated by laboratories belonging to the Human Genome Sequencing Consortium. Periodic freezes of this data were generated by NCBI, and EST, mRNA, BACend and other types of data used for the assembly and annotation were obtained from GenBank, European Molecular Biology Lab (EMBL), and DNA Data Bank of Japan (DDBJ). Clone maps were created by the individual sequencing centers.

The original chromosome files were produced by the institutions and individuals listed in the table below.

Chromosome	Sequencing Centers	AGP File
1	The Sanger Institute, Cambridge, UK University of Washington, Department of Genome Sciences, Seattle, WA, USA	Jane Rogers
2	The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA	Rick Wilson
3	Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA Beijing Genomics Institute/Human Genome Center, Beijing, China University of Washington, Department of Genome Sciences, Seattle, WA, USA	Steve Scherer
4	The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA Stanford Human Genome Center, Palo Alto, CA, USA	Rick Wilson
5	Joint Genome Institute, U.S. Department of Energy, Walnut Creek, CA, USA Stanford Human Genome Center, Palo Alto, CA, USA	Jeremy Schmutz
6	The Wellcome Trust Sanger Institute, Cambridge, UK	Jane Rogers
7	The Genome Institute at Washington University (WUSTL), St. Louis, MO USA University of Washington, Department of Genome Sciences, Seattle, WA, USA	Rick Wilson
8	The Broad Institute, Cambridge, MA USA Keio University, Tokyo, Japan Institute of Molecular Biology (IMB), Jena, Germany	Chad Nusbaum
9	The Wellcome Trust Sanger Institute, Cambridge, UK Gesellschaft fuer Biotechnologische Forschung (GBF), Braunschweig, Germany	Jane Rogers
10	The Wellcome Trust Sanger Institute, Cambridge, UK Genome Therapeutics Corporation (GTC), Waltham, MA, USA	Jane Rogers
11	RIKEN Human Genome Research Group, Japan The Broad Institute, Cambridge, MA USA	Todd Taylor
12	Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA	Steve Scherer
13	The Wellcome Trust Sanger Institute, Cambridge, UK	Jane Rogers
14	Genoscope National Sequencing Centre, Evry, France Institute for Systems Biology (ISB), Seattle, WA The Genome Institute at Washington University (WUSTL), St. Louis, MO USA RIKEN Human Genome Research Group, Japan	Jean Weissenbach
15	The Broad Institute, Cambridge, MA USA Institute of Molecular Biology (IMB) Institute for Systems Biology (ISB), Seattle, WA	Chad Nusbaum
16	Joint Genome Institute (JGI), U.S. Department of Energy, Walnut Creek, CA, USA Stanford Human Genome Center, Palo Alto, CA, USA The Institute for Genomic Research (TIGR), Rockville, MD, USA The Wellcome Trust Sanger Institute, Cambridge, UK	Jeremy Schmutz
17	The Broad Institute, Cambridge, MA USA	Chad Nusbaum
18	The Broad Institute, Cambridge, MA USA RIKEN Human Genome Research Group, Japan	Chad Nusbaum
19	Joint Genome Institute (JGI), U.S. Department of Energy, Walnut Creek, CA, USA Stanford Human Genome Center, Palo Alto, CA, USA	Jeremy Schmutz
20	The Wellcome Trust Sanger Institute, Cambridge, UK	James Gilbert
21	RIKEN Human Genome Research Group, Japan Max Planck Institute for Molecular Genetics, Berlin, Germany Gesellschaft fuer Biotechnologische Forschung (GBF), Braunschweig, Germany Keio University School of Medicine, Tokyo, Japan Chromosome 21 Consortium	Todd Taylor
22	The Wellcome Trust Sanger Institute, Cambridge, UK The University of Oklahoma, Norman, OK USA Keio University School of Medicine, Tokyo, Japan	Jane Rogers
X	The Wellcome Trust Sanger Institute, Cambridge, UK Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA Institute of Molecular Biology (IMB), Jena, Germany The Genome Institute at Washington University (WUSTL), St. Louis, MO USA Max Planck Institute for Molecular Genetics, Berlin, Germany	Jane Rogers
Y	The Genome Institute at Washington University (WUSTL), St. Louis, MO USA	Rick Wilson

Alpaca genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Alpaca Genome Browser/Annotations:

vicPac2: Hiram Clawson and Luvina Guruvadoo
vicPac1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The alpaca sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Armadillo genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Armadillo Genome Browser/Annotations:

dasNov3: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The armadillo sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Baboon genome

Funding: National Human Genome Research Institute (NHGRI), USA

Sequencing/Assembly: Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA

UCSC Baboon Genome Browser/Annotations:

papAnu4: Hiram Clawson and Lou Nassar
papAnu2: Chin Li and Luvina Guruvadoo
papHam1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The baboon sequence is made freely available for public use by the Baboon Genome Sequencing Consortium. Please review the BCM-HGSC conditions for use guidelines before using this data.

Bison genome

Sequencing/Assembly: University of Maryland, College Park, MD, USA

UCSC Bison Genome Browser/Annotations:

bisBis1: Hiram Clawson and Chris Villarreal

Data use

The bison sequence is made freely available by the University of Maryland Genome Assembly Group, Aleksey Zimin.

Bonobo genome

Funding: Italian grant Futuro in Ricerca 2010-RBFR103CE3 and the Howard Hughes Medical Institute

Sequencing/Assembly:

Eichler Laboratory, Department of Genome Sciences, University of Washington, Seattle
Mario Ventura Università degli studi di Bari Aldo Moro

UCSC Bonobo Genome Browser/Annotations:

panPan3: Hiram Clawson and Lou Nassar

Data use

The bonobo sequence is made freely available by the University of Bari and University of Washington.

Funding: European Research Council (grant 233297, TWOPAN) and the Max Planck Society

Sequencing/Assembly: Max-Planck Institute for Evolutionary Anthropology, Germany

UCSC Bonobo Genome Browser/Annotations:

panPan2: Hiram Clawson and Jairo Navarro
panPan1: Chin Li, Hiram Clawson, and Jonathan Casper

Data use

The bonobo sequence is made freely available by the Max-Planck Institute for Evolutionary Anthropology. The initial analysis of this dataset can be found in Prüfer K et al. The bonobo genome compared with the chimpanzee and human genomes. Nature. 2012 Jun 28;486(7404):527-31. PMID: 22722832

Brown kiwi genome

Sequencing/Assembly: Max-Planck Institute for Evolutionary Anthropology, Germany

UCSC Brown Kiwi Genome Browser/Annotations:

aptMan1: Chris Eisenhart and Luvina Guruvadoo

Data use

The brown kiwi sequence is made freely available by the Max-Planck Institute for Evolutionary Anthropology. The initial analysis of this dataset can be found in Le Duc D et al. Kiwi genome provides insights into evolution of a nocturnal lifestyle. Genome Biol. 2015 Jul 23;16:147. PMID: 26201466

Bushbaby genome

Funding: National Human Genome Research Institute (NHGRI)

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Bushbaby Genome Browser/Annotations:

otoGar3: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The bushbaby sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Cat genome

The UCSC felCat9 Genome Browser displays data produced by the Genome Sequencing Center (GSC) at Washington University (WashU) School of Medicine:

felCat9 browser: Nov. 2017 Felis_catus_9.0 assembly of the domestic cat

Sequencing/Assembly:

Genome Sequencing Center (GSC) at Washington University (WashU) School of Medicine, St. Louis, MO, USA

UCSC Cat Genome Browser/Annotations (felCat9): Hiram Clawson and Conner Powell

The UCSC felCat8 and felCat5 Genome Browsers display data produced by the International Cat Genome Sequencing Consortium:

felCat8 browser: Nov. 2014 Felis_catus_8.0 assembly of the domestic cat
felCat5 browser: Sep. 2011 Felis_catus-6.2 assembly of the domestic cat

Funding: National Human Genome Research Institute (NHGRI), Bethesda, MD, USA

454 Production Sequencing: The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA

BAC sequencing: The Genome Institute at WUSTL

Plasmid and fosmid sequencing: The Broad Institute, Cambridge, MA, USA and Agencourt Bioscience

Sequence assembly and data integration for creation of chromosomal AGP files: The Genome Institute at WUSTL

Cat RH map: Bill Murphy, Texas A&M University, College Station, TX, USA.

DNA source: Dr. Kristina Narfstrom, University of Missouri, Columbia, MO, USA.

UCSC Cat Genome Browser/Annotations:

felCat8: Hiram Clawson, Brian Raney, and Christopher Lee
felCat5: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

The UCSC felCat4 Genome Browser displays data from the Dec. 2008 catChrV17e draft assembly, based on sequence reads from six domestic cats and one wild cat.

Sequencing/Assembly:

NHGRI Genome Technology Branch and NIH Intramural Sequencing Center, Bethesda, MD, USA
Agencourt Bioscience, Danvers, MA, USA
NCI Laboratory of Genomic Diversity, Frederick, MD, USA
Hill's Pet Nutrition, Inc., Topeka, KS, USA

UCSC Cat Genome Browser/Annotations (felCat4): UCSC Genome Bioinformatics Group, Santa Cruz, CA, USA - Chin Li, Hiram Clawson, Antonio Coelho, Mary Goldman, and Donna Karolchik

The UCSC felCat3 Genome Browser displays data from the March 2006 v3 draft assembly of the domestic cat (Felis catus).

Funding: National Human Genome Research Institute (NHGRI), Bethesda, MD, USA

Project Leadership: The Broad Institute, Cambridge, MA, USA

Sequencing/Assembly: The Broad Institute - team led by Kerstin Lindblad-Toh

Plasmid and fosmid libraries: The Broad Institute

BAC end sequencing: Agencourt Bioscience, Danvers, MA, USA

UCSC Cat Genome Browser/Annotations (felCat3): Heather Trumbower, Angie Hinrichs, Mark Diekhans, Brooke Rhead, and Archana Thakkapallayil

Data use

The catChrV17e cat assembly was initially published in Pontius JU et al. Initial sequence and comparative analysis of the cat genome. Genome Res. 2007 Nov;17(11):1675-89. PMID: 17975172; PMCID: PMC2045150

The Broad Institute released the initial 2X (v3) assembly of the cat genome as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The cat sequence is made freely available with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Chimpanzee genome

The UCSC Chimpanzee Genome Browsers display whole-genome assembly data produced by the University of Washington:

Jan. 2018 panTro6 browser: Clint_PTRv2 124x PacBio; Illumina NextSeq 500; BioNano Saphyr (two enzyme)

and draft assembly data produced by the Chimpanzee Genome Sequencing Consortium:

May 2016 panTro5 browser: v3.0 6x Sanger; 55x Illumina; 9x PacBio
Feb. 2011 panTro4 browser: v2.1.4 chromosome-based 6X draft assembly
Oct. 2010 panTro3 browser: v2.1.3 chromosome-based 6X draft assembly
Mar. 2006 panTro2 browser: v2.1 chromosome-based 6X draft assembly (Oct. 2005)
Nov. 2003 panTro1 browser: v1.1 Arachne draft assembly

Funding: National Human Genome Research Institute (NHGRI)

Sequencing/Assembly (panTro6):

The whole-genome assembly of a male chimpanzee, Clint (Yerkes pedigree number C0471) was done in stages, resulting in three progressive versions:

The backbone of the assembly is made up of contigs obtained by assembling the PacBio (P6-C4 chemistry) reads using the Falcon assembler. These contigs are error-corrected using the tool Quiver and then further error-corrected using Pilon with 37-fold Illumina paired-end reads.
BioNano optical maps are used to place the contigs into scaffolds, followed by chromosome-level scaffolding using bicolor FISH.
The assembly is further error-corrected, particularly in regions that have clustered 1-2 bp indels. This is done by applying an Eichler-lab-created Freebayes-based indel correction method.

Sequencing/Assembly (panTro5):

BAC sequencing - McDonnell Genome Institute at Washington University School of Medicine, St Louis, MO. Sequence generation.
Assembly and data integration for creation of chromosomal AGP files:
- Lukas Kuderna and Tomas Marques-Bonet, ICREA at Institut de Biologia Evolutiva, (UPF-CSIC), PRBB, 08003 Barcelona, Spain, and LaDeana Hillier, McDonnell Genome Institute at Washington University School of Medicine, St. Louis, MO, USA
- Wes Warren, Lars Feuk (Uppsala U), Andrew Sharp (Mt Sinai), Ed Green (Dovetail), Mikkel Schierup (Aarhus U), and Bojan Obradovic (Illumina).

Sequencing/Assembly (panTro2, panTro3, panTro4):

Whole genome shotgun (WGS) sequence data assembly and organization: The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA
Underlying WGS data: The Genome Institute at WUSTL and The Broad Institute, Cambridge, MA USA
5-Mb region of chromosome 7 at chr7:84674857-89461887): finished by The Genome Institute at WUSTL
Chromosome Y sequence: finished by The Genome Institute at WUSTL, with detailed mapping and extensive collaboration with David Page's group at the Whitehead Institute. For more information, see:
- Hughes JF et al. Conservation of Y-linked genes during human evolution revealed by comparative sequencing in chimpanzee. Nature. 2005 Sep 1;437(7055):100-3. PMID: 16136134
- "The DNA Sequence of Chimpanzee Chromosome Y", unpublished.
Chromosome 21 sequence: Todd Taylor and the RIKEN Genomic Sciences Center. See Watanabe et al. DNA sequence and comparative analysis of chimpanzee chromosome 22. Nature. 2004 May 27;429(6990):382-8. PMID: 15164055

Sequencing/Assembly (panTro1): Sequence assembled by teams led by Eric Lander, Ph.D. at The Broad Institute and Richard K. Wilson, Ph.D. at The Genome Institute at WUSTL

Alignments: LaDeana Hillier, The Genome Institute at WUSTL and The Broad Institute

Annotations:

The Genome Institute at WUSTL, St. Louis, MO, USA
Research Group on Biomedical Informatics of IMIM/UPF (GRIB), Barcelona, Spain
Riken Genomic Sciences Center, Yokohama City, Japan
UCSC Genome Bioinformatics Group, Santa Cruz, CA, USA

UCSC Chimp Genome Browser:

panTro6: Hiram Clawson and Jairo Navarro
panTro5: Hiram Clawson and Chris Villarreal
panTro4: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik
panTro3: Hiram Clawson, Greg Roe, and Donna Karolchik
panTro2: Kate Rosenbloom, Brian Raney, Hiram Clawson, Ann Zweig, Archana Thakkapallayil, and Donna Karolchik
panTro1: Kate Rosenbloom, Jim Kent, Hiram Clawson, Heather Trumbower, Robert Kuhn, and Donna Karolchik

Data use

The initial sequencing and analysis of the chimpanzee genome was published by the Chimpanzee Sequencing and Analysis Consortium. The chimpanzee sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged. Please cite this publication when using these data:

The Chimpanzee Sequencing and Analysis Consortium. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature. 2005 Sep 1;437(7055):69-87. PMID: 16136131

Chinese hamster genome

Sequencing/Assembly: Eagle Genomics Ltd

UCSC Chinese Hamster Genome Browser/Annotations:

criGriChoV2: Hiram Clawson and Daniel Schmelter
criGri1: Hiram Clawson, Brian Raney, and Steve Heitner
criGriChoV1: Hiram Clawson, Max Haeussler, Matt Speir, and Cath Tyner

Data use

The Chinese hamster sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged. Please cite this publication when using these data:

Lewis NE et al. Genomic landscapes of Chinese hamster ovary cell lines as revealed by the Cricetulus griseus draft genome. Nat Biotechnol. 2013 Aug;31(8):759-65.

Chinese pangolin genome

Funding: NIH-NHGRI grant 5U54HG00307907 to RKW, Director of The Genome Institute at Washington University

DNA source: Dr. Stephen O'Brien at St. Petersburg State University

Sequencing/Assembly: Genome Institute, Washington University School of Medicine, St. Louis, MO, USA

UCSC Chinese Pangolin Genome Browser/Annotations:

manPen1: Hiram Clawson, Cath Tyner

Data use

The Chinese pangolin sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged: Dr. Richard K. Wilson and The Genome Institute, Washington University School of Medicine.

Cow genome

The UCSC Cow Genome Browser displays three different assemblies of the bovine genome: one provided by the University of Maryland Center for Bioinformatics and Computational Biology (UMD CBCB), one from the Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), and a third one from the USDA Agricultural Research Service (USDA ARS). The assemblies available in the browser include:

Apr. 2018 (bosTau9) assembly - ARS-UCD1.2
Jun. 2014 (bosTau8) assembly - UMD CBCB version UMD_3.1.1
Oct. 2011 (bosTau7) assembly - Bovine Genome Sequencing Consortium (BCM-HGSC) version Btau_4.6.1
Nov. 2009 (bosTau6) assembly - UMD CBCB version UMD_3.1
Oct. 2007 (bosTau4) assembly - BCM-HGSC version Btau_4.0
Aug. 2006 (bosTau3) assembly - BCM-HGSC version Btau_3.1
Mar. 2005 (bosTau2) assembly - BCM-HGSC version Btau_2.0
Sep. 2004 (bosTau1) assembly - BCM-HGSC version Btau_1.0

The UMD CBCB genome assemblies are provided with the following acknowledgments:

Genome sequence: NCBI Trace Archive

Genome assembly: University of Maryland Center for Bioinformatics and Computational Biology, College Park, MD, USA

UCSC Cow Genome Browser/Annotations:

bosTau9: Hiram Clawson and Lou Nassar
bosTau8: Steve Heitner and Luvina Guruvadoo
bosTau6: Hiram Clawson, Chin Li, Brooke Rhead, Greg Roe, Steve Heitner, and Donna Karolchik

The BCM-HGSC genome assemblies are provided with the following acknowledgments:

Funding - For a list of the many agencies that funded the cow sequencing project, see the BCM-HGSC Bovine Genome Project page.

BAC library DNA: Dr. Michael MacNeil's laboratory at the USDA Agricultural Research Service, Miles City, MT, USA

Whole genome shotgun sequence DNA: Dr. Timothy Smith's laboratory at the U.S. Meat Animal Research Center, Clay Center, NE, USA

BAC-based fingerprint map: Genome Sciences Centre, Vancouver, B.C.

UCSC Cow Genome Browser/Annotations:

bosTau7: Chin Li, Hiram Clawson, Brian Raney and Steve Heitner
bosTau4: Hiram Clawson, Brian Raney, and Ann Zweig
bosTau3: Heather Trumbower, Angie Hinrichs, Kayla Smith and Donna Karolchik

bosTau2

bosTau1 (archived): Heather Trumbower, Jim Kent, Hiram Clawson, Angie Hinrichs, Brian Raney, Mark Diekhans, Robert Kuhn, Ali Sultan-Qurraie and Donna Karolchik

Data use

The cow sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged. Please cite the appropriate publication when using these data.

UMD cow assembly: Zimin AV, Delcher AL, Florea L, Kelley DR, Schatz MC, Puiu D, Hanrahan F, Pertea G, Van Tassell CP, Sonstegard TS et al. A whole-genome assembly of the domestic cow, Bos taurus. Genome Biol. 2009;10(4):R42.

Baylor cow assembly: Bovine Genome Sequencing and Analysis Consortium. The genome sequence of taurine cattle: a window to ruminant biology and evolution. Science. 2009 Apr 24;324(5926):522-8.

For more information see the UMD CBCB and the BCM-HGSC Bovine Genome Project pages.

Crab-eating macaque genome

Funding: National Human Genome Research Institute (NHGRI)

DNA source: Dr. Jay Kaplan, Wake Forest Primate Facility, Wake Forest, NC

Genome Sequence: Genome Institute at Washington University (WUSTL), St. Louis, MO, USA

Sequence Assembly: Richa Agarwala, Sergey Shiryaev, NCBI and The Genome Institute, Washington University School of Medicine, St. Louis, MO, USA

Assembly curation: LaDeana Hillier, Genome Institute, Washington University School of Medicine, St. Louis, MO, USA

FISH mapping data: Mariano Rocchi, Department of Biology, University of Bari, Bari, Italy

UCSC Crab-eating Macaque Genome Browser/Initial Annotations:

macFas5: Hiram Clawson, Brian Raney and Steve Heitner

Data use

The crab-eating macaque data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Dog genome

canFam5 submitter: The University of Michigan.

Funding: The National Human Genome Research Institute

Project Leadership: The Broad Institute, Cambridge, MA, USA

Sequencing/Assembly: The Broad Institute - team led by Kerstin Lindblad-Toh, Ph.D.

Plasmid and fosmid libraries: The Broad Institute

BAC libraries: Children's Hospital Oakland Research Institute (CHORI), Oakland, CA, USA

BAC end sequencing: Agencourt Bioscience, Danvers, MA, USA

RH map: Centre National de la Recherche Scientifique (CNRS), France

FISH mapping: North Carolina State University (NCSU), Raleigh, NC, USA

Dog Selection: Fred Hutchinson Cancer Research Center (FHCRC), Seattle, WA, USA

UCSC Dog Genome Browser/Annotations:

canFam6: Hiram Clawson and Jairo Navarro
canFam5: Hiram Clawson and Lou Nassar
canFam4: Hiram Clawson and Lou Nassar
canFam3: Hiram Clawson and Luvina Guruvadoo
canFam2: Angie Hinrichs, Jennifer Jackson, and Donna Karolchik
canFam1: Angie Hinrichs, Robert Kuhn and Donna Karolchik

Data use

The dog genome sequence is made freely available by the Dog Genome Sequencing Project. Please cite this publication when using these data:

Lindblad-Toh K, Wade CM, Mikkelsen TS, Karlsson EK, Jaffe DB, Kamal M, Clamp M, Chang JL, Kulbokas EJ 3rd, Zody MC et al. Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature. 2005 Dec 8;438(7069):803-19. PMID: 16341006

Dolphin genome

Sequencing: Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA

Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Dolphin Genome Browser/Annotations:

turTru2: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The dolphin sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The dolphin data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Elephant genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Elephant Genome Browser/Annotations:

loxAfr3: Hiram Clawson, Pauline Fujita, Vanessa Swing, Antonio Coelho, and Donna Karolchik

Data use

The elephant sequence is made freely available before scientific publication with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Ferret genome

Funding: National Institute of Allergy and Infectious Diseases (NIAID), Genomic Sequencing Centers for Infectious Diseases (GSCID) program

Sequencing/Assembly: Ferret Genome Sequencing Consortium and The Broad Institute, Cambridge, MA, USA

UCSC Ferret Genome Browser/Annotations:

musFur1: Hiram Clawson, Brian Lee, and Brooke Rhead

Data use

The ferret sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged. Please cite this publication when using these data:

Peng X, Alföldi J, Gori K, Eisfeld AJ, Tyler SR, Tisoncik-Go J, Brawand D, Law GL, Skunca N, Hatta M et al. The draft genome sequence of the ferret (Mustela putorius furo) facilitates study of human respiratory disease. Nat Biotechnol. 2014 Dec;32(12):1250-5.

Garter snake genome

Sequencing/Assembly: The Genome Institute at Washington University School of Medicine

UCSC Garter Snake Genome Browser/Annotations:

thaSir1: Jonathan Casper, Conner Powell

Data use

The garter snake sequence is made freely available to the community by the Genome Sequencing Center, Washington University School of Medicine, with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Gibbon genome

Funding: National Human Genome Research Institute (NHGRI)

Sequencing/Assembly:

Gibbon Genome Sequencing Consortium
The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA
Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA

UCSC Gibbon Genome Browser/Annotations:

nomLeu3: Pauline Fujita and Hiram Clawson
nomLeu2: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik
nomLeu1: Chin Li, Brooke Rhead, Steve Heitner, Lauren Scott and Donna Karolchik

Data use

The gibbon sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged. Please cite this publication when using these data:

Carbone L, Harris RA, Gnerre S, Veeramah KR, Lorente-Galdos B, Huddleston J, Meyer TJ, Herrero J, Roos C, Aken B et al. Gibbon genome and the fast karyotype evolution of small apes. Nature. 2014 Sep 11;513(7517):195-201.

Golden eagle genome

Sequencing/Assembly: The Genome Institute at Washington University School of Medicine

UCSC Golden eagle Genome Browser/Annotations:

aquChr2: Hiram Clawson, Jairo Navarro

Data use

The golden eagle sequence is made freely available to the community by the Genome Sequencing Center, Washington University School of Medicine, with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Golden snub-nosed monkey genome

Sequencing/Assembly: Novogene

UCSC Golden Snub-Nosed Monkey Genome Browser/Annotations:

rhiRox1: Hiram Clawson, Luvina Guruvadoo

Data use

The golden snub-nosed monkey sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged. Please cite this publication when using these data:

Zhou X, Wang B, Pan Q, Zhang J, Kumar S, Sun X, Liu Z, Pan H, Lin Y, Liu G et al. Whole-genome sequencing of the snub-nosed monkey provides insights into folivory and evolutionary history. Nat Genet. 2014 Dec;46(12):1303-10. PMID: 25362486

Gorilla genome

The UCSC gorGor6 Genome Browser displays data from the Aug 2019 assembly of from the University of Washington.

Sequencing/Assembly: University of Washington

UCSC Gorilla Genome Browser/Annotations:

gorGor6: Hiram Clawson and Daniel Schmelter

The UCSC gorGor5 Genome Browser displays data from the Mar 2016 assembly of Gorilla gorilla gorilla produced by the University of Washington with the following acknowledgments:

Sequencing/Assembly: University of Washington

UCSC Gorilla Genome Browser/Annotations:

gorGor5: Hiram Clawson and Jairo Navarro

The UCSC gorGor4 and gorGor3 Genome Browsers display data produced by the Wellcome Trust Sanger Institute with the following acknowledgments:

Sequencing/Assembly:

The Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
The European Bioinformatics Institute (EBI), Hinxton, Cambridge, UK

UCSC Gorilla Genome Browser/Annotations:

gorGor4: Jonathan Casper, Brian Raney, Christopher Lee
gorGor3: Hiram Clawson, Mark Diekhans, Brian Raney, Brooke Rhead, Steve Heitner, Lauren Scott and Donna Karolchik

Data use

The gorilla sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged. Please cite this publication when using these data:

Scally A, Dutheil JY, Hillier LW, Jordan GE, Goodhead I, Herrero J, Hobolth A, Lappalainen T, Mailund T, Marques-Bonet T et al. Insights into hominid evolution from the gorilla genome sequence. Nature. 2012 Mar 7;483(7388):169-75. PMID: 22398555; PMCID: PMC3303130

Additionally, this publication discusses the gorGor5 assembly:

Gordon D, Huddleston J, Chaisson MJ, Hill CM, Kronenberg ZN, Munson KM, Malig M, Raja A, Fiddes I, Hillier LW et al. Long-read sequence assembly of the gorilla genome. Science. 2016 Apr 1;352(6281):aae0344. PMID: 27034376; PMCID: PMC4920363

Green monkey genome

The green monkey assembly is provided by the Vervet Genomics Consortium with the following acknowledgments:

Funding: National Human Genome Research Institute (NHGRI)

DNA source: Dr. Jay Kaplan, Wake Forest Primate Facility, Wake Forest, NC, USA

Genome Sequence:

Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
Department of Human Genetics, McGill University, Montreal, Canada

BAC Library: Dr. Pieter DeJong, CHORI, Oakland, CA, USA

Assembly Curation:

Jessica Wasserscheid, Nikoleta Juretic, Dr. Ken Dewar - McGill University, Montreal, QC Canada
LaDeana Hillier - Genome Institute, Washington University School of Medicine, St. Louis, MO, USA

FISH Mapping Data: Mariano Rocchi, Department of Biology, University of Bari, Bari, Italy

cDNA data - RNA sources: Dr. Nelson Freimer - Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, CA, USA

UCSC Green Monkey Genome Browser/Annotations:

chlSab2: Hiram Clawson, Brian Raney, Christopher Lee

Data use

The green monkey sequence is made freely available before scientific publication with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Guinea pig genome

Funding: National Human Genome Research Institute

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Guinea Pig Genome Browser/Annotations:

cavPor3: Tim Dreszer, Kate Rosenbloom, Hiram Clawson, Kayla Smith, Robert Kuhn, and Donna Karolchik

Data use

The guinea pig sequence is made freely available before scientific publication with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Hawaiian monk seal genome

Sequencing/Assembly: Johns Hopkins University, Baltimore, MA, USA

UCSC Hawaiian monk seal Genome Browser/Annotations:

neoSch1: Hiram Clawson, Robert Kuhn, and Gerardo Perez

Data use

The Hawaiian monk seal sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged: Alan F. Scott at Johns Hopkins University.

Mohr DW, Naguib A, Weisenfeld NI, Kumar V, Shah P, Church DM, Jaffe D, Scott AF. Improved de Novo Genome Assembly: Linked-Read Sequencing Combined with Optical Mapping Produce a High Quality Mammalian Genome at Relatively Low Cost. BioRxiv. 2017 April 18 (preprint).

Hedgehog genome

Sequencing/Assembly: The Broad Institute at MIT and Harvard, Cambridge, MA, USA

DNA Samples: Dr. Neil J. Gemmell - Reproduction, Genomics and Development Group, University of Otago, Dunedin, New Zealand

UCSC Hedgehog Genome Browser/Annotations:

eriEur2: Hiram Clawson, Brian Raney and Steve Heitner
eriEur1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The hedgehog sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The hedgehog data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Horse genome

The UCSC equCab3 Genome Browser displays data produced by the University of Louisville:

equCab3 browser: Jan. 2018 University of Louisville EquCab3.0

Sequencing/Assembly: University of Louisville, Louisville, KY, USA

UCSC Horse Genome Browser/Annotations:

(equCab3): Hiram Clawson and Conner Powell

Funding: The National Human Genome Research Institute

Project Leadership: The Broad Institute, Cambridge, MA, USA

Sequencing/Assembly: The Broad Institute

BAC End Reads: University of Veterinary Medicine Hanover, Germany, and the Helmholtz Centre for Infection Research, Braunschweig, Germany

UCSC Horse Genome Browser/Annotations:

equCab2: Larry Meyer, Hiram Clawson, Pauline Fujita, Brooke Rhead, and Donna Karolchik
equCab1: Fan Hsu, Brooke Rhead, Robert Kuhn, Hiram Clawson, Angie Hinrichs, Kate Rosenbloom, and Donna Karolchik

Data use

The horse sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged. Please cite this publication when using these data:

Theodore S. Kalbfleisch, Edward S. Rice, Michael S. DePriest Jr., Brian P. Walenz, Matthew S. Hestand et al. Improved reference genome for the domestic horse increases assembly contiguity and composition. Nature, Commincations Biology. 2018 Nov 16;197:2399-3642

Wade CM, Giulotto E, Sigurdsson S, Zoli M, Gnerre S, Imsland F, Lear TL, Adelson DL, Bailey E, Bellone RR et al. Genome sequence, comparative analysis, and population genetics of the domestic horse. Science. 2009 Nov 6;326(5954):865-7.

Kangaroo rat genome

Sequencing: Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA

Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Kangaroo Rat Genome Browser/Annotations:

dipOrd1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The kangaroo rat sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The kangaroo rat data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Malayan flying lemur genome

Funding: National Institutes of Health (NIH)

Assembly/Sequencing: The Genome Institute, Washington University School of Medicine, St. Louis, MO, USA

DNA Source: William Murphy - VIBS, Texas A&M University, College Station, TX, USA

UCSC Malayan Flying Lemur Genome Browser/ Annotations:

galVar1: Hiram Clawson and Luvina Guruvadoo

Data use

The Malayan flying lemur sequence may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged: Richard K. Wilson and The Genome Institute, Washington University School of Medicine.

Manatee genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Manatee Genome Browser/Annotations:

triMan1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The manatee sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged. Please cite this publication when using these data:

Foote AD, Liu Y, Thomas GW, Vinař T, Alföldi, Deng J, Dugan S, van Elk CE, Hunter ME, Joshi V et al. Convergent evolution of the genomes of marine mammals. Nat Genet. 2015 Mar;47(3):272-5. PMID: 25621460; PMCID: PMC4644735

Marmoset genome

The UCSC calJac4 Genome Browser displays data from the May 2020 assembly of white-tufted-ear marmoset produced by the McDonnell Genome Institute at Washington University with the following acknowledgments:

Sequencing and assembly: McDonnell Genome Institute at Washington University

DNA Source: Wisconsin Primate Research Center

UCSC Marmoset Genome Browser/Annotations:

calJac4: Hiram Clawson and Jairo Navarro

Sequencing and assembly: The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA and Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA

DNA Source: Dr. Suzette Tardif, Southwest National Primate Research Center, San Antonio, TX, USA

CHORI-259 BAC library: Dr. Pieter DeJong, Children's Hospital Oakland Research Institute, Oakland, CA, USA

UCSC Marmoset Genome Browser/Annotations:

calJac3: Hiram Clawson, Chin Li, Mark Diekhans, Mary Goldman, Vanessa Swing, and Donna Karolchik
calJac1: Hiram Clawson, Robert Kuhn, Pauline Fujita, Brooke Rhead, and Donna Karolchik

Data use

These data were produced by The Genome Institute at WUSTL and the BCM-HGSC and can be downloaded here.

The marmoset sequence data may be freely downloaded, used in analyses, and repackaged in databases if the provider is properly acknowledged. Please cite this publication when using these data:

Marmoset Genome Sequencing and Analysis Consortium. The common marmoset genome provides insight into primate biology and evolution. Nat Genet. 2014 Aug;46(8):850-7. PMID: 25038751; PMCID: PMC4138798

Additionally, this publication discusses the calJac4 assembly:

Wang W, Liu JY, Wang HF, Yang MY, Liu QY, Ding MX. The complete mitochondrial genome of white-tufted-ear marmoset, Callithrix jacchus (Primates: Callitrichinae). Mitochondrial DNA A DNA Mapp Seq Anal. 2016 May;27(3):1920-1. PMID: 25329280 doi: 10.3109/19401736.2014.971287

Megabat genome

Funding: National Human Genome Research Institute (NHGRI)

Sequencing/Assembly:

Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA
The Broad Institute, Cambridge, MA, USA

UCSC Megabat Genome Browser/Annotations:

pteVam1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The megabat sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The megabat data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Little brown bat genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Little brown bat Genome Browser/Annotations:

myoLuc2: Chin Li, Brian Raney, Robert Kuhn, Steve Heitner, Brooke Rhead, Greg Roe and Donna Karolchik

Data use

The Little brown bat sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The Little brown bat data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Minke whale genome

Sequencing/Assembly: Korea Institute of Ocean Science and Technology, Ansan, Republic of Korea

UCSC Minke Whale Genome Browser/Annotations:

balAcu1: Hiram Clawsom and Matthew Speir

Data use

The minke whale genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Yim HS, Cho YS, Guang X, Kang SG, Jeong JY, Cha SS, Oh HM, Lee JH, Yang EC, Kwon KK et al. Minke whale genome and aquatic adaptation in cetaceans. Nat Genet. 2014 Jan;46(1):88-92. PMID: 24270359

Mouse genome

Starting with mm10 (GRCm38), the mouse genome assembly is now provided by the Genome Reference Consortium (GRC). See the GRC Mouse Genome web pages for acknowledgments.

UCSC Mouse Genome Browser (mm10): Hiram Clawson and Luvina Guruvadoo
UCSC Mouse Genome Browser (mm39): Hiram Clawson and Jairo Navarro

Prior to mm10/GRCm38, mouse assemblies were produced by the Mouse Genome Sequencing Consortium, with these acknowledgments:

Funding:

Sequencing:

Physical Map:

Mouse BAC libraries: Children's Hospital Oakland Research Institute (CHORI)
BAC fingerprint map: Genome Sciences Centre
BAC end sequences: The Institute of Genome Research (TIGR)
TPF production - Tina Graves, April Cook, Christine Burton, Darren Graham, Shannon Dugan, Donna Muzny

Assembly: Deanna Church, Richa Agarwala, Josh Cherry, Mike DiCuccio, Hsiu-Chuan Chen, Paul Kitts, Victor Sapojnikov

cDNA Sequencing:

UCSC Mouse Genome Browser:

mm9: Hiram Clawson, Archana Thakkapallayil, Robert Kuhn and Donna Karolchik
mm8: Hiram Clawson, Fan Hsu, Kayla Smith, Ann Zweig, Robert Kuhn, Brooke Rhead, Archana Thakkapallayil, and Donna Karolchik
mm7: Hiram Clawson, Fan Hsu, Ann Zweig, Kayla Smith, Robert Kuhn and Donna Karolchik
mm6 (archived): Hiram Clawson, Fan Hsu, Jennifer Jackson, Robert Kuhn, Ali Sultan-Qurraie, Heather Trumbower and Donna Karolchik

Annotations:

The initial 3X coverage of the mouse genome was produced by the Mouse Sequencing Consortium, with the following acknowledgments:

Project Oversight: Arthur Holden and Francis Collins

Funding:

Sequencing:

The Broad Institute, Cambridge, MA, USA
The Genome Institute at Washington University (WUSTL)
The Wellcome Trust Sanger Institute

Data use

The mouse genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Mouse Genome Sequencing Consortium. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002 Dec 5;420(6915):520-62. PMID: 12466850

For more information about mouse assemblies provided by the GRC, see the GRC mouse genome web pages. For information about assemblies prior to mm10/GRCm38, see the Wellcome Trust Sanger Institute Mouse Resource Portal.

Mouse lemur genome

Sequep/Assembly:

The Broad Institute at MIT and Harvard, Cambridge, MA, USA
Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA

UCSC Mouse Lemur Genome Browser/Annotations:

micMur2: Hiram Clawson, Brian Raney, Christopher Lee
micMur1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The mouse lemur sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The mouse lemur data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Naked mole-rat genome

Sequencing/Assembly: Beijing Genomics Institute, Shenzhen, China

UCSC Naked Mole-rat Genome Browser/Annotations:

hetGla2: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik
hetGla1: Chin Li and Luvina Guruvadoo

Data use

The naked mole-rat genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Keane M, Craig T, Alföldi J, Berlin AM, Johnson J, Seluanov A, Gorbunova V, Di Palma F, Lindblad-Toh K, Church GM et al. The Naked Mole Rat Genome Resource: facilitating analyses of cancer and longevity-related adaptations. Bioinformatics. 2014 Dec 15;30(24):3558-60.

Opossum genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

FISH Mapping: North Carolina State University (NCSU), Raleigh, NC, USA

UCSC Opossum Genome Browser/Annotations:

monDom5: Andy Pohl, Hiram Clawson, Brian Raney, Mark Diekhans, Brooke Rhead and Katrina Learned
monDom4: Hiram Clawson, Archana Thakkapallayil, Ann Zweig, Kayla Smith and Donna Karolchik
monDom1: Hiram Clawson, Galt Barber, Ali Sultan-Qurraie, Brian Raney and Donna Karolchik

Data use

The opossum genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Mikkelsen TS, Wakefield MJ, Aken B, Amemiya CT, Chang JL, Duke S, Garber M, Gentles AJ, Goodstadt L, Heger A et al. Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences. Nature. 2007 May 10;447(7141):167-77.

Orangutan genome

Sequencing:

The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA
Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA
University of Washington, Seattle, WA, USA

Sequence Assembly and Chromosomal Sequence/AGP Construction: The Genome Institute at WUSTL

DNA Source:

Dr. Greely Stones, Gladys Porter Zoo, Brownsville, TX, USA
Susie_PABv2, Eichler Lab, Seattle, WA, USA

BAC Library: Yuko Yoshonaga in Pieter de Jong's laboratory, Children's Hospital Oakland Research Institute, Oakland, CA, USA

Fosmid Library: The Genome Institute at WUSTL

Fingerprint Map: The Genome Institute at WUSTL

Cytogenetic Mapping and Human/Orangutan Breakpoint Analyses: Mariano Rocchi, Department of Genetics and Microbiology, University of Bari, Bari, Italy

Fosmid End Placement against the Human Genome (used for breakpoint/inversion analyses during AGP construction): Lin Chen and Evan Eichler, Department of Genome Sciences, University of Washington, Seattle, WA, USA

Funding: National Human Genome Research Institute (NHGRI), National Institutes of Health (NIH)

UCSC Orangutan Genome Browser/Annotations:

ponAbe3: Hiram Clawson, Conner Powell
ponAbe2: Hiram Clawson, Kayla Smith, Robert Kuhn, Ann Zweig, and Donna Karolchik

Data use

The orangutan genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Locke DP, Hillier LW, Warren WC, Worley KC, Nazareth LV, Muzny DM, Yang SP, Wang Z, Chinwalla AT, Minx P et al. Comparative and demographic analysis of orang-utan genomes. Nature. 2011 Jan 27;469(7331):529-33.

Kronenberg ZN, Fiddes IT, Gordon D, Murali S, Dutcher SK, Gage FH, Warren WC, Shendure J, Haussler D, Schneider VA, Cao H, Ventura M, Wilson RK, Paten B, Pollen A, Eichler EE, et al. High-resolution comparative analysis of great ape genomes. Science 2018 Jun 8;360(6393).

Panda genome

Sequencing/Assembly: The Beijing Genomics Institute at Shenzhen

UCSC Panda Genome Browser/Annotations:

ailMel1: Hiram Clawson, Ann Zweig, and Antonio Coelho

Data use

The panda genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Li R, Fan W, Tian G, Zhu H, He L, Cai J, Huang Q, Cai Q, Li B, Bai Y, Zhang Z et al. The sequence and de novo assembly of the giant panda genome. Nature. 2010 Jan 21;463(7279):311-7. PMID: 20010809

Pig genome

Sequencing/Assembly:

UCSC Pig Genome Browser/Annotations:

susScr11: Hiram Clawson, Cath Tyner
susScr3: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik
susScr2: Hiram Clawson, Brian Raney, Mark Diekhans, and Mary Goldman

Data use

The pig genome sequence has been released by the International Swine Genome Sequencing Consortium under the terms of the Toronto Statement.

The International Swine Genome Sequencing Consortium genome-wide analyses of the pig genome is described in: Uenishi H, Morozumi T, Toki D, Eguchi-Ogawa T, Rund LA, Schook LB. Large-scale sequencing based on full-length-enriched cDNA libraries in pigs: contribution to annotation of the pig genome draft sequence. BMC Genomics. 2012 Nov 15;13:581. PMID: 23150988

The initial marker paper: Archibald AL, Bolund L, Churcher C, Fredholm M, Groenen MA, Harlizius B, Lee KT, Milan D, Rogers J, Rothschild MF et al. Pig genome sequence--analysis and publication strategy. BMC Genomics. 2010 Jul 19;11:438. PMID: 20642822; PMCID: PMC3017778

Pika genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Pika Genome Browser/Annotations:

ochPri3: Hiram Clawson, Brian Raney, and Steve Heitner
ochPri2: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The pika sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The pika data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Platypus genome

Sequencing/Assembly: The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA

UCSC Platypus Genome Browser/Annotations:

ornAna2: Hiram Clawson, Brian Raney, and Cath Tyner
ornAna1: Angie Hinrichs, Kayla Smith, Robert Kuhn, Brian Raney, and Donna Karolchik

Data use

The platypus genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Warren WC, Hillier LW, Marshall Graves JA, Birney E, Ponting CP, Grützner F, Belov K, Miller W, Clarke L, Chinwalla AT et al. Genome analysis of the platypus reveals unique signatures of evolution. Nature. 2008 May 8;453(7192):175-83. PMID: 18464734

Proboscis monkey genome

Sequencing/Assembly: Proboscis Monkey Functional Genome Consortium

UCSC Proboscis Monkey Genome Browser/Annotations:

nasLar1: Hiram Clawson and Jairo Navarro

Data use

The proboscis monkey sequence is made freely available before scientific publication with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Rabbit genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Rabbit Genome Browser/Annotations:

oryCun2: Hiram Clawson and Antonio Coelho

Data use

The rabbit sequence is made freely available before scientific publication with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Rat genome

rn7

Sequencing/Assembly: Wellcome Sanger Institute, Hinxton, Cambridgeshire, UK

UCSC Rat Genome Browser/Annotations:

rn7: Hiram Clawson, Lou Nassar, and Gerardo Perez

rn6 and older

The rat genome assembly (Rattus norvegicus) is provided by the Rat Genome Sequencing Consortium with the following acknowledgments:

Project Coordination: Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC)

Funding:

National Human Genome Research Institute (NHGRI)
National Heart, Lung and Blood Institute (NHLBI)

Sequencing:

Assembly: BCM-HGSC ATLAS group - Rui Chen, James Durbin, Paul Havlak

Fingerprint Mapping: The British Columbia Cancer Agency Genome Sciences Center - Marco Marra, Jackie Schein

BAC End Sequencing: The Institute for Genomic Research - Shaying Zhao

BAC Library Construction and Distribution: Children's Hospital Oakland Research Institute (CHORI) - Pieter deJong

Rat Genome Database: Medical College of Wisconsin

UCSC Rat Genome Browser/Annotations:

rn6: Hiram Clawson, Brian Raney and Matthew Speir
rn5: Hiram Clawson, Brian Raney and Steve Heitner
rn4: Angie Hinrichs, Fan Hsu, Brooke Rhead, Archana Thakkapallayil, Kayla Smith, Ann Zweig, Robert Kuhn, and Donna Karolchik
rn3: Hiram Clawson, Heather Trumbower, Robert Kuhn, Yontao Lu, Terry Furey, Mark Diekhans, Robert Baertsch, Donna Karolchik, and Jim Kent

Data use

The rat genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite these publications when using these data:

Havlak P, Chen R, Durbin KJ, Egan A, Ren Y, Song XZ, Weinstock GM, Gibbs RA. The Atlas genome assembly system. Genome Res. 2004 Apr;14(4):721-32. PMID: 15060016

Rat Genome Sequencing Project Consortium. Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature. 2004 Apr 1;428(6982):493-521. PMID: 15057822

See the Baylor conditions of use web page for restrictions on the use of these data.

Rhesus genome

The UCSC Genome Browser displays multiple assemblies of the Rhesus macaque genome produced by different institutions:

rheMac10 Genome Browser: Feb. 2019 draft assembly of the rhesus monkey, version Mmul_10, from the Genome Institute at Washington University School of Medicine.
rheMac8 Genome Browser: Nov. 2015 draft assembly of the rhesus monkey, version Mmul_8.0.1, from the Baylor College of Medicine Human Genome Sequencing Center.
rheMac3 Genome Browser: Oct. 2010 draft assembly of the rhesus monkey, version CR_1.0 from Beijing Genomics Institute-Shenzhen (BGI), Shenzhen, China
rheMac2 Genome Browser: Jan. 2006 draft assembly of the rhesus monkey, version Mmul_051212 from the Macaca mulatta Genome Sequencing Consortium (MMGSC) led by the Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), in collaboration with the J. Craig Venter Science Foundation Joint Technology Center and The Genome Institute at Washington University (WUSTL)
rheMac1 Genome Browser: Jan. 2005 draft assembly of the rhesus monkey, version Mmul_0.1 from MMGSC (see above)

The Genome Institute at Washington University School of Medicine assembly is provided with the following acknowledgments:

Macaca mulatta Sequence and Assembly Credits: Sequencing Funds - This work was supported by the NHGRI grant No. 5U24HG00908103, HIGH QUALITY HUMAN AND NON-HUMAN PRIMATE GENOME ASSEMBLIES, at the McDonnell Genome Institute, Washington University School of Medicine DNA source - Coriell sample id: AG07107, MMU Genome Sequencing - The McDonnell Genome Institute, Washington University School of Medicine. Sequence Assembly - The McDonnell Genome Institute, Washington University School of Medicine. Sequence assembly curation - The McDonnell Genome Institute, Washington University School of Medicine; The University of Washington, Seattle, WA.

The BGI assembly is provided with the following acknowledgments:

Funding: Southern China Center for Innovative Pharmaceuticals (SCCIP), the National Natural Science Foundation of China, the Shenzhen Municipal Government of China, the Major State Basic Research Development Program of China, the National Basic Research Program of China, and the National Science and Technology Major Project of Key Drug Innovation and Development

Sequencing/Assembly: BGI, Shenzhen, China

The MMGSC assembly is provided with the following acknowledgments:

Funding: National Human Genome Research Institute (NHGRI), USA

Sequencing/Assembly: BCM-HGSC, Houston, TX, USA in collaboration with:

J. Craig Venter Science Foundation Joint Technology Center, Rockville, MD, USA
The Genome Institute at WUSTL, St. Louis, MO, USA

BAC resources: Children's Hospital Oakland Research Institute (CHORI), Oakland, CA, USA

BAC-based fingerprint map: Genome Sciences Centre, Vancouver, B.C.

For more information on the MMGSC rhesus macaque genome project, see the BCM HGSC Rhesus Monkey Genome Project web page.

UCSC Rhesus Genome Browser/Annotations:

rheMac10: Hiram Clawson and Lou Nassar
rheMac8: Hiram Clawson, Christopher Lee, and Cath Tyner
rheMac3: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik
rheMac2: Robert Baertsch, Kayla Smith, Ann Zweig, Robert Kuhn, and Donna Karolchik
rheMac1 (archived): Robert Baertsch, Galt Barber, and Donna Karolchik

Data use

The Macaca mulatta Mmul_9.0 and Mmul_10 assembly sequences are made available to the community by the McDonnell Genome Institute, Washington University, St Louis, MO with the following understanding: Users are free to analyze these data if the providers of this data are properly cited, any whole genome analyses should follow guidelines outlined in the Toronto agreement on the use of this assembly.

The initial sequencing and analysis of the rhesus macaque genome produced by the MMGSC was published in: Rhesus Macaque Genome Sequencing and Analysis Consortium. Evolutionary and Biomedical Insights from the Rhesus Macaque Genome. Science. 2007 Apr 13;316(5822):222-34. PMID: 17431167

The initial sequencing and analysis of the rhesus macaque genome produced by BGI was published in: Yan G, Zhang G, Fang X, Zhang Y, Li C, Ling F, Cooper DN, Li Q, Li Y, van Gool AJ et al Genome sequencing and comparison of two nonhuman primate animal models, the cynomolgus and Chinese rhesus macaques. Nat Biotechnol. 2011 Oct 16;29(11):1019-23. PMID: 22002653

The rhesus macaque sequence is made freely available for public use with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged. For the rheMac1 and rheMac2 data, please cite the BCM-HGSC web site or publications from BCM-HGSC referring to the genome sequence.
The BCM-HGSC and RMGSC plan to publish the assembly and genomic annotation of the dataset, including large-scale identification of regions of evolutionary conservation.
This is in accordance with, and with the understandings in the Fort Lauderdale meeting discussing Community Resource Projects and the resulting NHGRI policy statement.
Any redistribution of the data should carry this notice.

Rock hyrax genome

Funding: National Human Genome Research Institute (NHGRI)

Sequencing/Assembly:

Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC), Houston, TX, USA
The Broad Institute, Cambridge, MA, USA

UCSC Rock Hyrax Genome Browser/Annotations:

proCap1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The rock hyrax sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The rock hyrax data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Sheep genome

Sequencing/Assembly: International Sheep Genomics Consortium (ISGC)

Mitochondrial Genome Sequencing: Justus-Liebig-University, Germany

UCSC Sheep Genome Browser/Annotations:

oviAri4: Hiram Clawson, Chris Lee, Daniel Schmelter
oviAri3: Hiram Clawson, Brian Raney, Matthew Speir, Brian Lee, Brooke Rhead, Steve Heitner, and Donna Karolchik
oviAri1: Chin Li, Greg Roe, and Luvina Guruvadoo

Data use

The sheep genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite these publications when using these data:

Jiang Y, Xie M, Chen W, Talbot R, Maddox JF, Faraut T, Wu C, Muzny DM, Li Y, Zhang W et al. The sheep genome illuminates biology of the rumen and lipid metabolism. Science. 2014 Jun 6;344(6188):1168-73. PMID: 24904168

International Sheep Genomics Consortium. The sheep genome reference sequence: a work in progress. Anim Genet. 2010 Oct;41(5):449-53. PMID: 20809919

Shrew genome

Funding: National Human Genome Research Institute (NHGRI)

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Shrew Genome Browser/Annotations:

sorAra2: Hiram Clawson, Brian Raney, and Steve Heitner
sorAra1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The shrew sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The shrew data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Sloth genome

Sequencing: The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA

Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Sloth Genome Browser/Annotations:

choHof1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The sloth sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The sloth data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Squirrel genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Squirrel Genome Browser/Annotations:

speTri2: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The squirrel sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The squirrel data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Squirrel monkey genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Squirrel Monkey Genome Browser/Annotations:

saiBol1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The squirrel monkey sequence is made freely available before scientific publication with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Tarsier genome

Funding: National Human Genome Research Institute (NHGRI)

Sequencing/Assembly:

The Broad Institute, Cambridge, MA, USA
Washington University, St Louis, MO, USA

UCSC Tarsier Genome Browser/Annotations:

tarSyr2: Hiram Clawson, Brian Raney, and Matthew Speir
tarSyr1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The tarsier sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The tarsier data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Tasmanian devil genome

Sequencing/Assembly: Wellcome Trust Sanger Institute, Cambridge, UK

UCSC Tasmanian Devil Genome Browser/Annotations:

sarHar1: Chin Li and Luvina Guruvadoo

Data use

The Tasmanian devil genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Murchison EP, Schulz-Trieglaff OB, Ning Z, Alexandrov LB, Bauer MJ, Fu B, Hims M, Ding Z, Ivakhno S, Stewart C et al. Genome sequencing and analysis of the Tasmanian devil and its transmissible cancer. Cell. 2012 Feb 17;148(4):780-91. PMID: 22341448

Tenrec genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Tenrec Genome Browser/Annotations:

echTel2: Hiram Clawson, Brian Raney and Steve Heitner
echTel1: Hiram Clawson, Brian Raney and Steve Heitner

Data use

The tenrec sequence is made freely available before scientific publication with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Tree shrew genome

Funding: National Human Genome Research Institute (NHGRI)

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Tree Shrew Genome Browser/Annotations:

tupBel1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The tree shrew sequence has been made freely available as part of the Mammalian Genome Project (29 Mammals Project). The initial analysis of this dataset can be found in Lindblad-Toh K et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011 Oct 12;478(7370):476-82. PMID: 21993624; PMCID: PMC3207357

The tree shrew data are provided with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
Any redistribution of the data should carry this notice.

Wallaby genome

Sequencing/Assembly: The Tammar Wallaby Genome Sequencing Consortium - Baylor College of Medicine Human Genome Sequencing Center (BCM-HGSC) and the Australian Genome Research Facility

UCSC Wallaby Genome Browser/Annotations:

macEug2: Chin Li and Luvina Guruvadoo

Data use

The wallaby genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Renfree MB, Papenfuss AT, Deakin JE, Lindsay J, Heider T, Belov K, Rens W, Waters PD, Pharo EA, Shaw G et al. Genome sequence of an Australian kangaroo, Macropus eugenii, provides insight into the evolution of mammalian reproduction and development. Genome Biol. 2011 Aug 29;12(8):R81. PMID: 21854559

White rhinoceros genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

Data Samples: Dr. Oliver Ryder, San Diego Zoo Institute for Conservation Research, San Diego, CA

UCSC White Rhinoceros Genome Browser/Annotations:

cerSim1: Hiram Clawson and Steve Heitner

Data use

The southern white rhinoceros sequence is made freely available before scientific publication with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

African clawed frog

Sequencing/Assembly: International Xenopus Sequencing Consortium

UCSC African clawed frog Genome Browser/Annotations:

xenLae2: Hiram Clawson and Cath Tyner

Data use

The African clawed frog genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Session AM, Uno Y, Kwon T, Chapman JA, Toyoda A, Takahashi S, Fukui A, Hikosaka A, Suzuki A, Kondo M et al. Genome evolution in the allotetraploid frog Xenopus laevis. Nature. 2016 Oct 20;538(7625):336-343. PMID: 27762356; PMC: PMC5313049

American alligator genome

Sequencing/Assembly: International Crocodilian Genomes Working Group

UCSC American Alligator Genome Browser/Annotations:

allMis1: Hiram Clawson, Jonathan Casper, and Brian Lee

Data use

The American alligator genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

St John JA, Braun EL, Isberg SR, Miles LG, Chong AY, Gongora J, Dalzell P, Moran C, Bed'hom B, Abzhanov A et al. Sequencing three crocodilian genomes to illuminate the evolution of archosaurs and amniotes. Genome Biol. 2012 Jan 31;13(1):415. PMID: 22293439

Atlantic cod genome

Funding: Research Council of Norway (FUGE)

Sequencing/Assembly: GenoFisk Consortium

UCSC Atlantic Cod Genome Browser/Annotations:

gadMor1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The Atlantic cod genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite these publications when using these data:

Johansen SD, Coucheron DH, Andreassen M, Karlsen BO, Furmanek T, Jørgensen TE, Emblem A, Breines R, Nordeide JT, Moum T et al. Large-scale sequence analyses of Atlantic cod. N Biotechnol. 2009 Jun;25(5):263-71. PMID: 19491044

Star B, Nederbragt AJ, Jentoft S, Grimholt U, Malmstrøm M, Gregers TF, Rounge TB, Paulsen J, Solbakken MH, Sharma A et al. The genome sequence of Atlantic cod reveals a unique immune system. Nature. 2011 Aug 10;477(7363):207-10. PMID: 21832995

Budgerigar genome

Sequencing/Assembly: The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA

UCSC Budgerigar Genome Browser/Annotations:

melUnd1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The budgerigar sequence is made freely available before scientific publication with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Chicken genome

Sequencing: The Genome Reference Consortium

The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA

Physical Map: The Genome Institute at WUSTL

Genetic Mapping/Linkage Analysis: The Chicken Mapping Consortium:

Wageningen Map based on Wageningen broilerxbroiler mapping population ("W" linkage maps; 460 F2 animals), coordinated by Martien Groenen, Wageningen University, Wageningen, The Netherlands
East Lansing Map based on the RJFxlayer cross ("E" linkage maps; 56 BC animals), coordinated by Hans Cheng, USDA-ARS, East Lansing, MI, USA
Compton map based on layerxlayer cross ("C" linkage maps; 52 BC animals), coordinated by Nat Bumstead, Institute for Animal Health
Consensus linkage map: Martien Groenen and Richard Crooijmans, Wageningen University
RH map: coordinated by Mireille Morisson and Alain Vignal, French National Institute for Agricultural Research (INRA), Toulouse
Linkage Mapping: White Leghorn X Red Junglefowl (800 F2 animals), coordinated by Susanne Kerje, Lina Jacobsson and Leif Andersson, Uppsala University, Uppsala, Sweden
Clone/Marker Pairs: coordinated by Jerry Dodgson, Michigan State University, East Lansing, MI, USA
Marker Name/Sequence Resolution: coordinated by Hans Cheng, USDA-ARS, East Lansing, MI, USA
Chicken FPC browser: ChickFPC, Martien Groenen and Richard Crooijmans (galGal3), Jan Aerts (galGal2), Wageningen University
Additional Mapping Data Contributions: Winston Bellott, David Page Lab, Whitehead Institute for Biomedical Research (WIBR-MIT), Cambridge, MA, USA

cDNA sequences: National Institutes of Health and The University of Manchester BBSRC ChickEST Database

RJF finished clones:

NIH Intramural Sequencing Center (NISC) (AC091708.2, AC094012.2, AC138565.3)
S. Kerje and L. Andersson, Uppsala University (AY636124.1)
The Genome Institute at WUSTL (all others)

Assembly, Assembly/Map Integration, Golden Path Creation: The Genome Institute at WUSTL

SNPs and cross-referenced gene annotations: The Beijing Genomics Institute (BGI), Beijing, China

UCSC Chicken Genome Browser/Annotations:

galGal6: Hiram Clawson, Daniel Schmelter
galGal5: Hiram Clawson, Brian Raney and Matthew Speir
galGal4: Hiram Clawson, Brian Raney and Steve Heitner
galGal3: Angie Hinrichs, Kayla Smith, Donna Karolchik
galGal2: Angie Hinrichs, Heather Trumbower, Rachel Harte, Donna Karolchik

Data use

The chicken genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

International Chicken Genome Sequencing Consortium. Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature. 2004 Dec 9;432(7018):695-716. PMID: 15592404

The BGI gene annotations and SNP data are made available with the following terms of use:

Users are free to use these data in scientific papers analyzing particular genes, if the Beijing Genomics Institute (BGI) is acknowledged.
BGI and its collaborators reserve the right to publish the initial analyses of these data, including but not restricted to the large-scale identification of functional polymorphisms, evolutionary patterns and signs of selection, correlations with known QTLs, utility of data for genetic mapping purposes, etc.
Any redistribution of these data should carry this notice.

Coelacanth genome

Funding: National Human Genome Research Institute (NHGRI)

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Coelacanth Genome Browser/Annotations:

latCha1: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The coelacanth genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Amemiya CT, Alföldi J, Lee AP, Fan S, Philippe H, Maccallum I, Braasch I, Manousaki T, Schneider I, Rohner N et al. The African coelacanth genome provides insights into tetrapod evolution. Nature. 2013 Apr 18;496(7445):311-6. PMID: 23598338

Elephant shark genome

Sequencing/Assembly: Singapore Institute of Molecular and Cell Biology

UCSC Elephant Shark Genome Browser/Annotations:

calMil1: Hiram Clawson, Jonathan Casper, and Donna Karolchik

Data use

The elephant shark genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Venkatesh B, Lee AP, Ravi V, Maurya AK, Lian MM, Swann JB, Ohta Y, Flajnik MF, Sutoh Y, Kasahara M et al. Elephant shark genome provides unique insights into gnathostome evolution. Nature. 2014 Jan 9;505(7482):174-9. PMID: 24402279

Fugu genome

The Fugu genome assembly (Takifugu rubripes) is provided by the Joint Genome Institute (JGI) as part of the International Fugu Genome Consortium, led by the JGI and the Singapore Institute of Molecular and Cell Biology (IMCB).

Sequencing/Assembly: Assembly was constructed with the JGI assembler, JAZZ, from paired end sequencing reads produced by JGI and the IMCB at JGI, Myriad Genetics, and Celera Genomics

UCSC Fugu Genome Browser/Annotations:

fr3: Hiram Clawson and Greg Roe
fr2: Cory McClean, Hiram Clawson, Ann Zweig, and Donna Karolchik
fr1: Kate Rosenbloom, Heather Trumbower, Robert Kuhn, and Donna Karolchik

Data use

The Fugu genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Kai W, Kikuchi K, Tohari S, Chew AK, Tay A, Fujiwara A, Hosoya S, Suetake H, Naruse K, Brenner S et al. Integration of the genetic map and genome assembly of fugu facilitates insights into distinct features of genome evolution in teleosts and mammals. Genome Biol Evol. 2011;3:424-42. PMID: 21551351

Lamprey genome

Funding: National Human Genome Research Institute (NHGRI)

Sequencing/Assembly:

University of Kentucky, Lexington, KY, USA
The Genome Institute at WUSTL, St. Louis, MO, USA
Jeramiah Smith (University of Kentucky, University of Washington Department of Genome Sciences, and Benaroya Research Institute at Virginia Mason)

UCSC Lamprey Genome Browser/Annotations:

petMar3: Hiram Clawson and Jairo Navarro
petMar2: Hiram Clawson, Brian Raney, and Steve Heitner
petMar1: Hiram Clawson, Ann Zweig, Pauline Fujita, and Donna Karolchik

Data use

The lamprey genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Smith JJ, Kuraku S, Holt C, Sauka-Spengler T, Jiang N, Campbell MS, Yandell MD, Manousaki T, Meyer A et al. Sequencing of the sea lamprey (Petromyzon marinus) genome provides insights into vertebrate evolution. Nat Genet. 2013 Apr;45(4):415-21, 421e1-2. PMID: 234350859

Lizard genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA USA

UCSC Lizard Genome Browser/Annotations:

anoCar2: Hiram Clawson, Brian Raney, Luvina Guruvadoo, and Donna Karolchik
anoCar1: Hiram Clawson, Archana Thakkapallayil, Robert Kuhn, and Donna Karolchik

Data use

The lizard genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Alföldi J, Di Palma F, Grabherr M, Williams C, Kong L, Mauceli E, Russell P, Lowe CB, Glor RE, Jaffe JD et al. The genome of the green anole lizard and a comparative analysis with birds and mammals. Nature. 2011 Aug 31;477(7366):587-91. PMID: 21881562

Medaka genome

The medaka genome assembly (Oryzias latipes) is provided by Japan's National Institute of Genetics (NIG) and the University of Tokyo. The assembly was constructed using the RAMEN assembler.

NOTE: The UCSC oryLat1 assembly was replaced by oryLat2 in Nov. 2008 to correct a UCSC assembly error with chrUn in which the gap relationships between contigs within their ultracontigs were broken. The oryLat2 assembly is based on the same assembly as oryLat1: the v1.0 assembly from NIG and the University of Tokyo.

Sequencing/Assembly: NIG and University of Tokyo, Japan

UCSC Medaka Genome Browser/Annotations:

oryLat2: Hiram Clawson, Pauline Fujita, and Donna Karolchik

Data use

The medaka genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Kasahara M, Naruse K, Sasaki S, Nakatani Y, Qu W, Ahsan B, Yamada T, Nagayasu Y, Doi K, Kasai Y et al. The medaka draft genome and insights into vertebrate genome evolution. Nature. 2007 Jun 7;447(7145):714-9. PMID: 17554307

Medium ground finch genome

Sequencing/Assembly: Genome 10K Project and Beijing Genomics Institute (BGI)

UCSC Medium Ground Finch Genome Browser/Annotations:

geoFor1: Hiram Clawson and Greg Roe

Data use

The medium ground finch genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Zhang G, Li C, Li Q, Li B, Larkin DM, Lee C5, Storz JF, Antunes A, Greenwold MJ, Meredith RW et al. Comparative genomics reveals insights into avian genome evolution and adaptation. Science. 2014 Dec 12;346(6215):1311-20. PMID: 25504712; PMCID: PMC4390078

Nile tilapia genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Nile Tilapia Genome Browser/Annotations:

oreNil2: Hiram Clawson, Chin Li, Brian Raney, Pauline Fujita, Luvina Guruvadoo, Steve Heitner, Brooke Rhead, Greg Roe, and Donna Karolchik

Data use

The Nile tilapia genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Brawand D, Wagner CE, Li YI, Malinsky M, Keller I, Fan S, Simakov O, Ng AY, Lim ZW, Bezault E et al. The genomic substrate for adaptive radiation in African cichlid fish. Nature. 2014 Sep 18;513(7518):375-81. PMID: 25186727; PMCID: PMC4353498

Painted turtle genome

Sequencing/Assembly: International Painted Turtle Genome Sequencing Consortium

UCSC Painted Turtle Genome Browser/Annotations:

chrPic1: Hiram Clawson, Brian Raney and Steve Heitner

Data use

The painted turtle genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Badenhorst D, Hillier LW, Literman R, Montiel EE, Radhakrishnan S, Shen Y, Minx P, Janes DE, Warren WC, Edwards SV et al. Physical mapping and refinement of the painted turtle genome (Chrysemys picta) inform amniote genome evolution and challenge turtle-bird chromosomal conservation. Genome Biol Evol. 2015 Jun 24;7(7):2038-50. PMID: 26108489; PMCID: PMC4524486

Stickleback genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA USA

UCSC Stickleback Genome Browser/Annotations:

gasAcu1: Angie Hinrichs, Hiram Clawson, Kayla Smith, and Donna Karolchik

Data use

The stickleback sequence is made freely available before scientific publication with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Tetraodon genome

Sequencing/Assembly:

Genoscope, Evry, France
The Broad Institute, MA, USA

Project Funding:

Consortium National de Recherche en Genomique, France
National Human Genome Research Institute (NHGRI), USA

Initial Genome Browser Annotations:

Genoscope - Genscan gene predictions, GAZE gene predictions
UCSC Genome Bioinformatics Group, University of California, Santa Cruz, CA, USA - remaining annotations (based on data from Genoscope)

UCSC Tetraodon Genome Browser:

tetNig2: Hiram Clawson, Mark Diekhans, Brian Raney, Ann Zweig, Mary Goldman, Vanessa Swing, Antonio Coelho, Robert Kuhn, and Donna Karolchik
tetNig1: Rachel Harte, Robert Kuhn, Donna Karolchik

Data use

The Tetraodon genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Yue GH, Lo LC, Zhu ZY, Lin G, Feng F. The complete nucleotide sequence of the mitochondrial genome of Tetraodon nigroviridis. DNA Seq. 2006 Apr;17(2):115-21. PMID: 17076253

Tibetan frog genome

Sequencing/Assembly: The Beijing Genomics Institute, Shenzhen, China

UCSC Tibetan Frog Genome Browser/Annotations:

nanPar1: Hiram Clawson and Jairo Navarro

Data use

The Tibetan frog genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Sun YB, Xiong ZJ, Xiang XY, Liu SP, Zhou WW, Tu XL, Zhong L, Wang L, Wu DD, Zhang BL et al. Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes. Proc Natl Acad Sci U S A. 2015 Mar 17;112(11):E1257-62. PMID: 25733869; PMCID: PMC4371989

Turkey genome

Sequencing/Assembly: The Turkey Genome Consortium - Virginia Tech College of Agriculture and Life Sciences and Virginia Bioinformatics Institute, Blacksburg, VA, USA

UCSC Turkey Genome Browser/Annotations:

melGal1: Chin Li, Mark Diekhans, Brian Raney, Brooke Rhead, Greg Roe, Steve Heitner, and Donna Karolchik
melGal5: Hiram Clawson and Chris Villarreal

Data use

The turkey genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Dalloul RA, Long JA, Zimin AV, Aslam L, Beal K, Blomberg Le Ann, Bouffard P, Burt DW, Crasta O, Crooijmans RP et al. Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo): genome assembly and analysis. PLoS Biol. 2010 Sep 7;8(9). PMID: 20838655

X. tropicalis genome

xenTro10

Sequencing/Assembly: University of California, Berkeley, Berkeley, CA, USA

UCSC X. tropicalis Genome Browser/Annotations:

xenTro10: Hiram Clawson, Lou Nassar, and Gerardo Perez

xenTro9 and older

Sequencing/Assembly: Joint Genome Institute (JGI), Walnut Creek, CA, USA

UCSC X. tropicalis Genome Browser/Annotations:

xenTro9: Hiram Clawson and Christopher Lee
xenTro7: Hiram Clawson, Brian Raney, and Matthew Speir
xenTro3: Hiram Clawson, Brian Raney, Brooke Rhead, Greg Roe, Steve Heitner, and Donna Karolchik
xenTro2: Angie Hinrichs, Kayla Smith, Robert Kuhn, and Donna Karolchik
xenTro1: Fan Hsu, Jim Kent, Heather Trumbower, Hiram Clawson, Brian Raney, Galt Barber, Mark Diekhans, Angie Hinrichs, and Donna Karolchik

Data use

The X. tropicalis genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Hellsten U, Harland RM, Gilchrist MJ, Hendrix D, Jurka J, Kapitonov V, Ovcharenko I, Putnam NH, Shu S, Taher L et al. The genome of the Western clawed frog Xenopus tropicalis. Science. 2010 Apr 30;328(5978):633-6. PMID: 20431018

Zebra finch genome

Sequencing/Assembly: The Genome Institute at WUSTL, St. Louis, MO, USA

Zebra finch DNA: Arthur P. Arnold lab, Dept. of Physiological Science, UCLA, Los Angeles, CA, USA

UCSC Zebra Finch Genome Browser/Annotation:

taeGut2: Hiram Clawson, Brian Raney, and Steve Heitner
taeGut1: Brian Raney, Kayla Smith, Pauline Fujita, and Donna Karolchik

Data use

The zebra finch genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Warren WC, Clayton DF, Ellegren H, Arnold AP, Hillier LW, Künstner A, Searle S, White S, Vilella AJ, Fairley S et al. The genome of a songbird. Nature. 2010 Apr 1;464(7289):757-62. PMID: 20360741

Zebrafish genome

Starting with the danRer10 (GRCz10) assembly, the zebrafish genome sequence is now provided by the Genome Reference Consortium (GRC). See the GRC Zebrafish Genome web page for acknowledgments.

Most Recent UCSC Zebrafish Genome Browser (danRer11): Christopher Eisenhart, Hiram Clawson, Christopher Lee

Prior assemblies have the following acknowledgments:

Sequencing/Assembly: The Wellcome Trust Sanger Institute, UK.

FPC map:

The Wellcome Trust Sanger Institute, UK
Hubrecht Laboratory, Netherlands Institute for Developmental Biology, The Netherlands
Max-Planck Institute for Developmental Biology, Germany

BAC Clones, BAC End Pairs:

The Wellcome Trust Sanger Institute, UK - GenBank accessions for BAC clones
Max-Planck Institute for Developmental Biology, Germany - BAC end pairs sequence data
Hubrecht Laboratory, Netherlands Institute for Developmental Biology, The Netherlands - BAC end pairs sequence data

Annotations:

Yi Zhou, Anthony DiBiase, Leonard Zon and the Zebrafish Genome Initiative at Boston Children's Hospital, Boston, MA, US - collaboration on Zebrafish WZ EST clusters, BAC End Singles/Pairs, Radiation Hybrid, and Human tBLASTn annotation tracks
Washington University in St. Louis, St. Louis, MO, US - Zebrafish WZ EST clusters data
Ensembl, UK - Ensembl gene predictions
UCSC Genome Bioinformatics Group, University of California, Santa Cruz, CA, USA - Other annotations

UCSC Zebrafish Genome Browsers:

danRer11: Christopher Eisenhart, Hiram Clawson, Christopher Lee
danRer10: Hiram Clawson, Brian Raney, Steve Heitner
danRer7: Hiram Clawson, Mary Goldman, Greg Roe, Brian Raney, Donna Karolchik
danRer6: Galt Barber, Robert Kuhn, Katrina Learned, Donna Karolchik
danRer5: Rachel Harte, Ann Zweig, Donna Karolchik
danRer4: Rachel Harte, Archana Thakkapallayil, Robert Kuhn, Ann Zweig, Donna Karolchik
danRer3: Rachel Harte, Jennifer Jackson, Ann Zweig, Ali Sultan-Qurraie, Donna Karolchik
danRer2 (archived): Rachel Harte, Mark Diekhans, Heather Trumbower, Donna Karolchik, Jennifer Jackson

Data use

The zebrafish genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Howe K, Clark MD, Torroja CF, Torrance J, Berthelot C, Muffato M, Collins JE, Humphray S, McLaren K, Matthews L et al. The zebrafish reference genome sequence and its relationship to the human genome. Nature. 2013 Apr 25;496(7446):498-503. PMID: 23594743

For more information about zebrafish assemblies provided by the GRC, see the GRC zebrafish web pages. For information about assemblies prior to danRer10/GRCz10, see the Sanger Institute's Zebrafish Sequencing Project web page.

C. intestinalis genome

Project Funding: Work performed under the auspices of the US Department of Energy's Office of Science, Biological and Environmental Research Program and the by the University of California, Lawrence Livermore National Laboratory under Contract No. W-7405-Eng-48, Lawrence Berkeley National Laboratory under contract No. DE-AC03-76SF00098 and Los Alamos National Laboratory under contract No. W-7405-ENG-36.

Project Collaborators:

Joint Genome Institute (JGI), Walnut Creek, CA, USA
Department of Zoology, Graduate School of Science, Kyoto University, Kyoto, Japan
Department of Molecular and Cellular Biology, University of California, Berkeley, CA, USA
National Institute of Genetics, Mishima, Japan
Oak Ridge National Laboratory, Oak Ridge, TN, USA
Los Alamos National Laboratory, Los Alamos, NM, USA

Annotations:

Colin Dewey, University of California, Berkeley, CA, USA - SNAP Gene Predictions, using the SNAP parser written by Ian Korf
JGI, Walnut Creek, CA, USA - JGI Gene Predictions
UCSC Genome Bioinformatics Group, University of California, Santa Cruz, CA, USA - Other annotations

UCSC C. intestinalis Genome Browsers:

ci3: Hiram Clawson and Chris Villarreal
ci2: Brian Raney, Mark Diekhans, Ann Zweig, Kayla Smith, Robert Kuhn, Donna Karolchik
ci1: Brian Raney, Galt Barber, Heather Trumbower, Robert Kuhn, Donna Karolchik

Data use

The C. intestinalis sequence data have been freely provided by the JGI and may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Dehal P, Satou Y, Campbell RK, Chapman J, Degnan B, De Tomaso A, Davidson B, Di Gregorio A, Gelpke M, Goodstein DM et al. The draft genome of Ciona intestinalis: insights into chordate and vertebrate origins. Science. 2002 Dec 13;298(5601):2157-67. PMID: 12481130

For more information about the C. intestinalis assembly, see the JGI C. intestinalis portal.

Lancelet genome

Funding: U.S. Department of Energy Office of Science, Biological and Environmental Research Program

Sequencing/Assembly: JGI

UCSC Lancelet Genome Browser/Annotations:

braFlo1: Hiram Clawson, Ann Zweig, Pauline Fujita, Donna Karolchik

Data use

The lancelet genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Putnam NH, Butts T, Ferrier DE, Furlong RF, Hellsten U, Kawashima T, Robinson-Rechavi M, Shoguchi E, Terry A, Yu JK et al. The amphioxus genome and the evolution of the chordate karyotype. Nature. 2008 Jun 19;453(7198):1064-71. PMID: 18563158

S. purpuratus genome

Funding: National Human Genome Research Institute (NHGRI)

Sequencing/Assembly: BCM-HGSC, Houston, TX, USA

UCSC S. purpuratus Genome Browsers/Annotations:

strPur2: Kord Kober, Heather Trumbower, Rachel Harte, Kayla Smith, Donna Karolchik
strPur1: Andy Pohl, Kayla Smith, Donna Karolchik

Data use

The sea urchin genome data may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite this publication when using these data:

Sea Urchin Genome Sequencing Consortium. The genome of the sea urchin Strongylocentrotus purpuratus. Science. 2006 Nov 10;314(5801):941-52. PMID: 17095691

For more information on the sea urchin genome project, see the BCM-HGSC Sea Urchin Genome Project web page.

A. gambiae genome

The MOZ2 Anopheles gambiae data were provided by the International Anopheles Genome Project and downloaded from Ensembl. The data have the following acknowledgments:

Funding:

Celera Genomics and Genoscope were funded respectively by National Institutes of Health (NIH) and the French Ministry of Research.

Project collaboration: The International Anopheles Genome Project - a collaboration between:

Sequencing and Assembly:

Data Downloads: Ensembl

UCSC A. gambaie Genome Browser/Annotations:

anoGam1: Angie Hinrichs, Galt Barber, Donna Karolchik
anoGam3: Hiram Clawson, Luis Nassar

Additional Genome Browser Annotations:

Ensembl
Celera Genomics (Ensembl Genes)

Data use

The A. gambaie sequence is made freely available by the International Anopheles Genome Project. Please cite the following publication when using these data:

Holt RA, Subramanian GM, Halpern A, Sutton GG, Charlab R, Nusskern DR, Wincker P, Clark AG, Ribeiro JM, Wides R et al. The genome sequence of the malaria mosquito Anopheles gambiae. Science. 2002 Oct 4;298(5591):129-49. PMID: 12364791

A. mellifera genome

Funding:

National Human Genome Research Institute (NHGRI)
U.S. Department of Agriculture (USDA)

Sequencing and assembly: BCM-HGSC, Houston, TX, USA

BAC library: Pieter de Jong and Katzutoyo Osoegawa at Children's Hospital Oakland Research Institute (CHORI), Oakland, CA, USA

UCSC A. mellifera Genome Browser/Annotations:

apiMel2: Andy Pohl, Angie Hinrichs, Jennifer Jackson, Donna Karolchik
apiMel1: Angie Hinrichs, Brian Raney, Robert Kuhn, Donna Karolchik

Data use

The A. mellifera sequence is made freely available by the Honeybee Genome Sequencing Consortium. Please cite the following publication when using these data:

Honeybee Genome Sequencing Consortium. Insights into social insects from the genome of the honeybee Apis mellifera. Nature. 2006 Oct 26;443(7114):931-49. PMID: 17073008

See the Baylor conditions of use statement for guidelines regarding the use of these data.

D. ananassae genome

The droAna2 data were provided by Agencourt Bioscience. The droAna1 data were produced by the Institute for Genomic Research (TIGR).

Sequencing: Agencourt Bioscience, Danvers, MA, USA

Assembly:

Agencourt Bioscience, Danvers, MA, USA, using the Arachne assembler (droAna2)
TIGR, Rockville, MD, USA, using an assembler produced by Celera (droAna1)

UCSC A. mellifera Genome Browser/Annotations:

droAna2: Angie Hinrichs, Brian Raney, Ann Zweig, Kayla Smith, and Donna Karolchik
droAna1: Angie Hinrichs, Brian Raney, Heather Trumbower, Ali Sultan-Qurraie, and Donna Karolchik

Data use

The D. ananassae sequence may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite these publications when using these data:

Drosophila 12 Genomes Consortium. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007 Nov 8;450(7167):203-18. PMID: 17994087

Zimin AV, Smith DR, Sutton G, Yorke JA. Assembly reconciliation. Bioinformatics. 2008 Jan 1;24(1):42-5. PMID: 18057021

D. erecta genome

Sequencing and Assembly: Agencourt Bioscience, Danvers, MA, USA, using the Arachne assembler

UCSC D. erecta Genome Browser/Annotations:

droEre1: Angie Hinrichs, Brian Raney, Jennifer Jackson and Donna Karolchik

Data use

The D. erecta sequence may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite these publications when using these data:

Drosophila 12 Genomes Consortium. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007 Nov 8;450(7167):203-18. PMID: 17994087

Zimin AV, Smith DR, Sutton G, Yorke JA. Assembly reconciliation. Bioinformatics. 2008 Jan 1;24(1):42-5. PMID: 18057021

D. grimshawi genome

Sequencing and Assembly: Agencourt Bioscience, Danvers, MA, USA, using the Arachne assembler

UCSC D. grimshawi Genome Browser/Annotations:

droGri1: Angie Hinrichs, Brian Raney, Jennifer Jackson, Kayla Smith and Donna Karolchik

Data use

The D. grimshawi sequence may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite these publications when using these data:

Drosophila 12 Genomes Consortium. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007 Nov 8;450(7167):203-18. PMID: 17994087

Zimin AV, Smith DR, Sutton G, Yorke JA. Assembly reconciliation. Bioinformatics. 2008 Jan 1;24(1):42-5. PMID: 18057021

D. melanogaster genome

The D. melanogaster assemblies displayed in the UCSC Genome Browser were obtained from the Berkeley Drosophila Genome Project (BDGP) with the following acknowledgments:

Sequencing and Assembly:

Euchromatic sequence - BDGP, Berkeley, CA, USA
Heterochromatic sequence - Drosophila Heterochromatin Genome Project, Lawrence Berkeley National Laboratory, Berkeley, CA, USA

Annotations:

Euchromatic regions - FlyBase
Heterochromatic regions - DHGP, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
UCSC Genome Bioinformatics Group, UCSC, Santa Cruz, CA, USA

UCSC D. melanogaster Genome Browser:

dm6: Hiram Clawson, Angie Hinrichs, and Matthew Speir
dm3: Angie Hinrichs, Archana Thakkapallayil, Kayla Smith, and Donna Karolchik
dm2: Angie Hinrichs, Brian Raney, Galt Barber, and Donna Karolchik
dm1: Angie Hinrichs, Brian Raney, Heather Trumbower, Ali Sultan-Qurraie, and Donna Karolchik

Data use

For additional information about these data, including citation guidelines, see the BDGP web site.

D. mojavensis genome

Sequencing and Assembly: Agencourt Bioscience, Danvers, MA, USA, using the Arachne assembler

Funding: National Human Genome Research Institute (NHGRI), USA

UCSC D. mojavensis Genome Browser/Annotations:

droMoj2: Angie Hinrichs, Brian Raney, Jennifer Jackson and Donna Karolchik
droMoj1: Angie Hinrichs, Brian Raney, Galt Barber and Donna Karolchik

Data use

The D. mojavensis sequence may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite these publications when using these data:

Drosophila 12 Genomes Consortium. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007 Nov 8;450(7167):203-18. PMID: 17994087

Zimin AV, Smith DR, Sutton G, Yorke JA. Assembly reconciliation. Bioinformatics. 2008 Jan 1;24(1):42-5. PMID: 18057021

D. persimilis genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC D. persimilis Genome Browser/Annotations:

droPer1: Angie Hinrichs, Kayla Smith, Robert Kuhn, Jennifer Jackson and Donna Karolchik

Data use

The D. persimilis sequence may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite the following publications when using these data:

Drosophila 12 Genomes Consortium. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007 Nov 8;450(7167):203-18. PMID: 17994087

Zimin AV, Smith DR, Sutton G, Yorke JA. Assembly reconciliation. Bioinformatics. 2008 Jan 1;24(1):42-5. PMID: 18057021

D. pseudoobscura genome

Sequencing and Assembly: BCM-HGSC, Houston, TX, USA

UCSC D. pseudoobscura Genome Browser/Annotations:

dp3: Angie Hinrichs, Brian Raney, Jennifer Jackson and Donna Karolchik
dp2: Angie Hinrichs, Heather Trumbower, Robert Kuhn, Donna Karolchik

Data use

The D. pseudoobscura data are made available with specific conditions for use. Please cite the following publications:

Richards S, Liu Y, Bettencourt BR, Hradecky P, Letovsky S, Nielsen R, Thornton K, Hubisz MJ, Chen R, Meisel RP et al. Comparative genome sequencing of Drosophila pseudoobscura: chromosomal, gene, and cis-element evolution. Genome Res. 2005 Jan;15(1):1-18. PMID: 15632085; PMCID: PMC540289

Drosophila 12 Genomes Consortium. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007 Nov 8;450(7167):203-18. PMID: 17994087

D. sechellia genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA USA

UCSC D. sechellia Genome Browser/Annotations:

droSec1: Angie Hinrichs, Kayla Smith and Donna Karolchik

Data use

The D. sechellia sequence may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite the following publications when using these data:

Drosophila 12 Genomes Consortium. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007 Nov 8;450(7167):203-18. PMID: 17994087

Zimin AV, Smith DR, Sutton G, Yorke JA. Assembly reconciliation. Bioinformatics. 2008 Jan 1;24(1):42-5. PMID: 18057021

D. simulans genome

Sequencing and assembly: The Genome Institute at Washington University (WUSTL), St. Louis, MO, USA

UCSC D. simulans Genome Browser/Annotations:

droSim1: Angie Hinrichs, Jennifer Jackson, Ali Sultan-Qurraie, Brian Raney and Donna Karolchik

Data use

The D. simulans sequence may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite the following publications when using these data:

Drosophila 12 Genomes Consortium. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007 Nov 8;450(7167):203-18. PMID: 17994087

D. virilis genome

Sequencing and Assembly: Agencourt Bioscience, Danvers, MA, USA, using the Arachne assembler

Funding: National Human Genome Research Institute (NHGRI), USA

UCSC D. virilis Genome Browser/Annotations:

droVir2: Angie Hinrichs, Brian Raney, Jennifer Jackson and Donna Karolchik
droVir1: Angie Hinrichs, Brian Raney, Heather Trumbower, Robert Kuhn, Jennifer Jackson, and Donna Karolchik

Data use

The D. virilis sequence may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite the following publications when using these data:

Drosophila 12 Genomes Consortium. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007 Nov 8;450(7167):203-18. PMID: 17994087

Zimin AV, Smith DR, Sutton G, Yorke JA. Assembly reconciliation. Bioinformatics. 2008 Jan 1;24(1):42-5. PMID: 18057021

D. yakuba genome

Funding: National Human Genome Research Institute (NHGRI)

Sequencing, assembly and AGP creation: The Genome Institute at Washington University (WUSTL)

Initial Sequencing Proposal, Project Collaboration: David J. Begun and Charles H. Langley, University of California, Davis, CA, USA

UCSC D. yakuba Genome Browser/Annotations:

droYak2: Angie Hinrichs, Jennifer Jackson and Donna Karolchik
droYak1: Angie Hinrichs, Michael Chalup and Donna Karolchik

Data use

The D. yakuba sequence may be freely downloaded, used in analyses, and repackaged in databases if the data provider is properly acknowledged. Please cite the following publications when using these data:

Drosophila 12 Genomes Consortium. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007 Nov 8;450(7167):203-18. PMID: 17994087

Zimin AV, Smith DR, Sutton G, Yorke JA. Assembly reconciliation. Bioinformatics. 2008 Jan 1;24(1):42-5. PMID: 18057021

C. elegans genome

The C. elegans data was obtained from WormBase. We'd like to thank The Genome Institute at Washington University (WUSTL) and the Sanger Institute for their collaborative work on sequencing the C. elegans genome, and the WormBase consortium for providing access to the current C. elegans sequence.

Worm Genomes (other than C. elegans)

The latest versions of the C. brenneri, C. briggsae, C. japonica, C. remanei, and P. pacificus sequences were obtained from the Genome Sequencing Center at The Genome Institute at Washington University (WUSTL). The cb1 browser data were obtained from WormBase.

We'd like to thank WUSTL for providing the sequence data for these assemblies, as well as the Sanger Institute for their collaborative work in sequencing the initial C. briggsae cb1 genome. Thanks also to the WormBase consortium for providing access to the cb1 sequence.

Data use

The worm sequences are made freely available with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The centers producing the data reserve the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome.
This is in accordance with, and with the understandings in the Fort Lauderdale meeting discussing Community Resource Projects and the resulting NHGRI policy statement.
Any redistribution of the data should carry this notice.

Yeast genome

The April 2011 Sacchromyces cerevisiae genome assembly (sacCer3) is based on sequence dated April 2011 in the Sacchromyces Genome Database (SGD). This genome sequence was downloaded from NCBI (genbank/genomes/Eukaryotes/fungi/Saccharomy ces_cerevisiae/SacCer_Apr2011/). The SGD gene annotations were downloaded from the SGD downloads site. The S288C strain was used in this sequencing project.

The June 2008 Sacchromyces cerevisiae genome assembly (sacCer2) is based on SGD sequence dated June 2008, and was obtained from the site http://downloads.yeastgenome.org/sequence/genomic_sequence/chromosomes/fasta/. The S288C strain was used in this sequencing project.

The Oct. 2003 Sacchromyces cerevisiae genome assembly (sacCer1) is based on SGD sequence dated 1 Oct. 2003. The sequence, open reading frame (ORF), and gene annotations were downloaded from the site http://www.yeastgenome.org/download-data/sequence. The cellular localization and protein abundance data displayed in the UCSC Gene Sorter are taken from the Yeast GFP Fusion Localization Database.

UCSC Sacchromyces cerevisiae Genome Browser/Annotations:

sacCer3: Hiram Clawson, Greg Roe, Steve Heitner

Data use

See the SGD Systematic Sequencing Table for credit and contact information for each of the chromosomes in the assembly. We'd like to thank Stanford University, the SGD, the University of California San Francisco (UCSF), Washington University in St. Louis, and The Broad Institute for providing the data and annotations for these browsers.

California sea hare genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC California Sea Hare Genome Browser/Annotations:

aplCal1: Galt Barber, Brian Raney, Hiram Clawson, Katrina Learned and Donna Karolchik

Data use

The California sea hare sequence is made freely available with the following understanding:

The data may be freely downloaded, used in analyses, and repackaged in databases.
Users are free to use the data in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.
The center producing the data reserves the right to publish the initial large-scale analyses of the data set, including large-scale identification of regions of evolutionary conservation and large-scale genomic assembly. Large-scale refers to regions with size on the order of a chromosome (that is, 30 Mb or more).
Any redistribution of the data should carry this notice.

Ebola virus genome

Sequencing/Assembly: The Broad Institute, Cambridge, MA, USA

UCSC Ebola virus Genome Browser/Annotations:

eboVir3: Hiram Clawson, Brian Raney, Maximillian Haeussler, Angie Hirichs, Kate Rosenbloom, Chris Eisenhart, Pauline Fujita, Matthew Speir, Donna Karolchik, Ann Zweig, David Haussler, and Jim Kent

Other Acknowledgements: Thanks to Phillip Berman and his lab at UCSC for providing continued feedback on the Ebola virus Genome Browser, and to the Pardis Sabeti Lab at the Broad Institute for their guidance.

Thank you to the following institutions for proving additional annotations on the Ebola virus genome:

National Center for Biotechnology Information (NCBI)
Immune Epitope Database (IEDB)
UniProt
RCSB Protein Data Bank (PDB)

Data use

The Ebola virus sequence may be freely downloaded, used in analyses, and repackaged in databases. The data may be freely used in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.

Monkeypox virus genome

Sequencing/Assembly: n/a

UCSC Monkeypox virus Genome Browser/Annotations:

mpxvRivers: Hiram Clawson, Brian Raney, Jairo Navarro

Data use

The Monkeyopx virus sequence may be freely downloaded, used in analyses, and repackaged in databases. The data may be freely used in scientific papers analyzing particular genes and regions if the provider of these data is properly acknowledged.

Acknowledgment of early support

During the initial stage of this project, from January 2000 to June 2001, we received no direct federal funding to develop the working draft other than travel support. However, we gratefully acknowledge the support for our personnel provided by our basic bioinformatics research grants, especially DOE grant DE-FG03-99ER62849 (to develop gene-finding methods), but also NSF grant DBI-9809007, UC BIOTECH grant 99-11, a grant from the Sloan Foundation, the David and Lucille Packard Foundation, NIH grant GM-52848, and the Zahler grant that provided support for Jim Kent in the early stages of this work.

We are particularly thankful for the risks that Patrick Mantey (UCSC Dean of Engineering), James Gill (Vice Chancellor for Research), John Simpson (UCSC Provost), and M.R.C. Greenwood (UCSC Chancellor) were willing to take in advancing us the money in February 2000 needed to build the original 100-node LINUX cluster used in this work, and in the case of Dean Mantey, additional personnel support. Phokion Kolaitis, Chair of Computer Science, contributed as well.

We thank Compaq for providing an additional 100-CPU computational cluster in Cambridge, MA for use by the public human genome effort. We thank ILOG, Inc. (now part of IBM) for providing discounted licenses to its CPLEX linear programming software, which was essential for some of our early experiments.

In December 2000, HHMI began providing support for equipment and personnel. In the summer of 2001, HHMI provided the primary funding for a new cluster of 1000 CPUs running LINUX purchased from Rackable (now SGI), with additional support from CISI and NHGRI. Dubbed the UCSC KiloKluster, this equipment became the computational backbone for the initial years of the Genome Browser project.