%0 Journal Article %J PLoS Genet %D 2020 %T Copy number variants and fixed duplications among 198 rhesus macaques (Macaca mulatta). %A Brasó-Vives, Marina %A Povolotskaya, Inna S %A Hartasánchez, Diego A %A Farré, Xavier %A Fernandez-Callejo, Marcos %A Raveendran, Muthuswamy %A Harris, R Alan %A Rosene, Douglas L %A Lorente-Galdos, Belen %A Navarro, Arcadi %A Marques-Bonet, Tomas %A Rogers, Jeffrey %A Juan, David %K Animals %K Chromosome Mapping %K DNA Copy Number Variations %K Female %K Gene Duplication %K Genetics, Population %K Genome %K High-Throughput Nucleotide Sequencing %K Humans %K Macaca mulatta %K Male %K Open Reading Frames %K Phylogeny %K Sequence Analysis, DNA %K Species Specificity %X

The rhesus macaque is an abundant species of Old World monkeys and a valuable model organism for biomedical research due to its close phylogenetic relationship to humans. Copy number variation is one of the main sources of genomic diversity within and between species and a widely recognized cause of inter-individual differences in disease risk. However, copy number differences among rhesus macaques and between the human and macaque genomes, as well as the relevance of this diversity to research involving this nonhuman primate, remain understudied. Here we present a high-resolution map of sequence copy number for the rhesus macaque genome constructed from a dataset of 198 individuals. Our results show that about one-eighth of the rhesus macaque reference genome is composed of recently duplicated regions, either copy number variable regions or fixed duplications. Comparison with human genomic copy number maps based on previously published data shows that, despite overall similarities in the genome-wide distribution of these regions, there are specific differences at the chromosome level. Some of these create differences in the copy number profile between human disease genes and their rhesus macaque orthologs. Our results highlight the importance of addressing the number of copies of target genes in the design of experiments and cautions against human-centered assumptions in research conducted with model organisms. Overall, we present a genome-wide copy number map from a large sample of rhesus macaque individuals representing an important novel contribution concerning the evolution of copy number in primate genomes.

%B PLoS Genet %V 16 %P e1008742 %8 2020 05 %G eng %N 5 %1 https://www.ncbi.nlm.nih.gov/pubmed/32392208?dopt=Abstract %R 10.1371/journal.pgen.1008742 %0 Journal Article %J Nat Commun %D 2020 %T Discovery and population genomics of structural variation in a songbird genus. %A Weissensteiner, Matthias H %A Bunikis, Ignas %A Catalán, Ana %A Francoijs, Kees-Jan %A Knief, Ulrich %A Heim, Wieland %A Peona, Valentina %A Pophaly, Saurabh D %A Sedlazeck, Fritz J %A Suh, Alexander %A Warmuth, Vera M %A Wolf, Jochen B W %K Animals %K Chromosome Inversion %K Gene Deletion %K Genetic Variation %K Genetics, Population %K Genome %K Genomic Structural Variation %K Genotype %K Phylogeny %K Polymorphism, Single Nucleotide %K Retroelements %K Sequence Analysis, DNA %K Songbirds %X

Structural variation (SV) constitutes an important type of genetic mutations providing the raw material for evolution. Here, we uncover the genome-wide spectrum of intra- and interspecific SV segregating in natural populations of seven songbird species in the genus Corvus. Combining short-read (N = 127) and long-read re-sequencing (N = 31), as well as optical mapping (N = 16), we apply both assembly- and read mapping approaches to detect SV and characterize a total of 220,452 insertions, deletions and inversions. We exploit sampling across wide phylogenetic timescales to validate SV genotypes and assess the contribution of SV to evolutionary processes in an avian model of incipient speciation. We reveal an evolutionary young (~530,000 years) cis-acting 2.25-kb LTR retrotransposon insertion reducing expression of the NDP gene with consequences for premating isolation. Our results attest to the wealth and evolutionary significance of SV segregating in natural populations and highlight the need for reliable SV genotyping.

%B Nat Commun %V 11 %P 3403 %8 2020 07 07 %G eng %N 1 %1 https://www.ncbi.nlm.nih.gov/pubmed/32636372?dopt=Abstract %R 10.1038/s41467-020-17195-4 %0 Journal Article %J Nature %D 2020 %T Mapping and characterization of structural variation in 17,795 human genomes. %A Abel, Haley J %A Larson, David E %A Regier, Allison A %A Chiang, Colby %A Das, Indraniel %A Kanchi, Krishna L %A Layer, Ryan M %A Neale, Benjamin M %A Salerno, William J %A Reeves, Catherine %A Buyske, Steven %A Matise, Tara C %A Muzny, Donna M %A Zody, Michael C %A Lander, Eric S %A Dutcher, Susan K %A Stitziel, Nathan O %A Hall, Ira M %K Alleles %K Case-Control Studies %K Continental Population Groups %K Epigenesis, Genetic %K Female %K Gene Dosage %K Genetic Variation %K Genetics, Population %K Genome, Human %K High-Throughput Nucleotide Sequencing %K Humans %K Male %K Molecular Sequence Annotation %K Quantitative Trait Loci %K Software %K Whole Genome Sequencing %X

A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0-11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing.

%B Nature %V 583 %P 83-89 %8 2020 07 %G eng %N 7814 %1 https://www.ncbi.nlm.nih.gov/pubmed/32460305?dopt=Abstract %R 10.1038/s41586-020-2371-0 %0 Journal Article %J Nature %D 2020 %T A structural variation reference for medical and population genetics. %A Collins, Ryan L %A Brand, Harrison %A Karczewski, Konrad J %A Zhao, Xuefang %A Alföldi, Jessica %A Francioli, Laurent C %A Khera, Amit V %A Lowther, Chelsea %A Gauthier, Laura D %A Wang, Harold %A Watts, Nicholas A %A Solomonson, Matthew %A O'Donnell-Luria, Anne %A Baumann, Alexander %A Munshi, Ruchi %A Walker, Mark %A Whelan, Christopher W %A Huang, Yongqing %A Brookings, Ted %A Sharpe, Ted %A Stone, Matthew R %A Valkanas, Elise %A Fu, Jack %A Tiao, Grace %A Laricchia, Kristen M %A Ruano-Rubio, Valentin %A Stevens, Christine %A Gupta, Namrata %A Cusick, Caroline %A Margolin, Lauren %A Taylor, Kent D %A Lin, Henry J %A Rich, Stephen S %A Post, Wendy S %A Chen, Yii-Der Ida %A Rotter, Jerome I %A Nusbaum, Chad %A Philippakis, Anthony %A Lander, Eric %A Gabriel, Stacey %A Neale, Benjamin M %A Kathiresan, Sekar %A Daly, Mark J %A Banks, Eric %A MacArthur, Daniel G %A Talkowski, Michael E %K Continental Population Groups %K Disease %K Female %K Genetic Testing %K Genetic Variation %K Genetics, Medical %K Genetics, Population %K Genome, Human %K Genotyping Techniques %K Humans %K Male %K Middle Aged %K Mutation %K Polymorphism, Single Nucleotide %K Reference Standards %K Selection, Genetic %K Whole Genome Sequencing %X

Structural variants (SVs) rearrange large segments of DNA and can have profound consequences in evolution and human disease. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD) have become integral in the interpretation of single-nucleotide variants (SNVs). However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25-29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings. This SV resource is freely distributed via the gnomAD browser and will have broad utility in population genetics, disease-association studies, and diagnostic screening.

%B Nature %V 581 %P 444-451 %8 2020 05 %G eng %N 7809 %1 https://www.ncbi.nlm.nih.gov/pubmed/32461652?dopt=Abstract %R 10.1038/s41586-020-2287-8 %0 Journal Article %J Elife %D 2020 %T A variant-centric perspective on geographic patterns of human allele frequency variation. %A Biddanda, Arjun %A Rice, Daniel P %A Novembre, John %K Gene Frequency %K Genetic Variation %K Genetics, Population %K Geography %K Humans %X

A key challenge in human genetics is to understand the geographic distribution of human genetic variation. Often genetic variation is described by showing relationships among populations or individuals, drawing inferences over many variants. Here, we introduce an alternative representation of genetic variation that reveals the relative abundance of different allele frequency patterns. This approach allows viewers to easily see several features of human genetic structure: (1) most variants are rare and geographically localized, (2) variants that are common in a single geographic region are more likely to be shared across the globe than to be private to that region, and (3) where two individuals differ, it is most often due to variants that are found globally, regardless of whether the individuals are from the same region or different regions. Our variant-centric visualization clarifies the geographic patterns of human variation and can help address misconceptions about genetic differentiation among populations.

%B Elife %V 9 %8 2020 12 22 %G eng %1 https://www.ncbi.nlm.nih.gov/pubmed/33350384?dopt=Abstract %R 10.7554/eLife.60107