genotype1 {ClustMMDD} | R Documentation |
genotype1
is a data frame of genotype data with ploidy = 2.
A simulated data frame of genotype data with N = 1000
individuals genotyped at P = 10
loci.
Each string represents two alleles : ploidy = 2
. For example, "109107" represents ≤ft\{"109", "107"\right\}. The last column of the data frame contains integers that represent the population membership.
The format is: chr [1:1000, 1:10] "109107" "105101" "106106" ... and the 11 th column contains integers representing
the prior classification in 5
sub-populations.
Wilson Toussile
Dominique Bontemps and Wilson Toussile (2013) : Clustering and variable selection for categorical multivariate data. Electronic Journal of Statistics, Volume 7, 2344-2371, ISSN.
Wilson Toussile and Elisabeth Gassiat (2009) : Variable selection in model-based clustering using multilocus genotype data. Adv Data Anal Classif, Vol 3, number 2, 109-134.
data(genotype1) head(genotype1)