TY - JOUR
T1 - Linkage disequilibrium network analysis (LDna) gives a global view of chromosomal inversions, local adaptation and geographic structure
AU - Kemppainen, Petri
AU - Knight, Christopher
AU - Sarma, Devojit
AU - Hlaing, Thaung
AU - Prakash, Anil
AU - Maung, Maung
AU - Naung, Yan
AU - Somboon, Pradya
AU - Mahanta, Jagadish
AU - Walton, Catherine
N1 - This work was supported by grants 089229/Z/09/Z and 097820/Z/11/A from the Wellcome Trust.
PY - 2015
Y1 - 2015
N2 - Recent advances in sequencing allow population-genomic data to be generated for virtually any species. However, approaches to analyze such data lag behind the ability to generate it, particularly in non-model species. Linkage disequilibrium (LD, the non-random association of alleles from different loci) is a highly sensitive indicator of many evolutionary phenomena including chromosomal inversions, local adaptation and geographical structure. Here we present linkage disequilibrium network analysis (LDna), which accesses information on LD shared between multiple loci genome-wide. In LD networks, vertices represent loci and connections between vertices represent the LD between them. We analyzed such networks in two test cases: a new Restriction-site Associated DNA sequence (RAD- seq) dataset for Anopheles baimaii, a Southeast Asian malaria vector; and a well- characterised single nucleotide polymorphism (SNP) dataset from 21 three-spined stickleback individuals. In each case we readily identified five distinct LD network clusters (single outlier clusters, SOCs), each comprising many loci connected by high LD. In A. baimaii, further population-genetic analyses supported the inference that each SOC corresponds to a large inversion, consistent with previous cytological studies. For sticklebacks, we inferred that each SOC was associated with a distinct evolutionary phenomenon: two chromosomal inversions, local adaptation, population-demographic history and geographic structure. LDna is thus a useful exploratory tool, able to give a global overview of LD associated with diverse evolutionary phenomena and identify loci potentially involved. LDna does not require a linkage map or reference genome, so is applicable to any population- genomic dataset, making it especially valuable for non-model species.
AB - Recent advances in sequencing allow population-genomic data to be generated for virtually any species. However, approaches to analyze such data lag behind the ability to generate it, particularly in non-model species. Linkage disequilibrium (LD, the non-random association of alleles from different loci) is a highly sensitive indicator of many evolutionary phenomena including chromosomal inversions, local adaptation and geographical structure. Here we present linkage disequilibrium network analysis (LDna), which accesses information on LD shared between multiple loci genome-wide. In LD networks, vertices represent loci and connections between vertices represent the LD between them. We analyzed such networks in two test cases: a new Restriction-site Associated DNA sequence (RAD- seq) dataset for Anopheles baimaii, a Southeast Asian malaria vector; and a well- characterised single nucleotide polymorphism (SNP) dataset from 21 three-spined stickleback individuals. In each case we readily identified five distinct LD network clusters (single outlier clusters, SOCs), each comprising many loci connected by high LD. In A. baimaii, further population-genetic analyses supported the inference that each SOC corresponds to a large inversion, consistent with previous cytological studies. For sticklebacks, we inferred that each SOC was associated with a distinct evolutionary phenomenon: two chromosomal inversions, local adaptation, population-demographic history and geographic structure. LDna is thus a useful exploratory tool, able to give a global overview of LD associated with diverse evolutionary phenomena and identify loci potentially involved. LDna does not require a linkage map or reference genome, so is applicable to any population- genomic dataset, making it especially valuable for non-model species.
KW - Landscape genomics
KW - Anopheles dirus
KW - Anopheles gambiae
KW - graph theory
KW - chromosomal rearrangement
KW - R package
U2 - 10.1111/1755-0998.12369
DO - 10.1111/1755-0998.12369
M3 - Article
SN - 1755-0998
JO - Molecular Ecology Resources
JF - Molecular Ecology Resources
ER -