Highlights
- Pro
Stars
GEO-BON Genetics Working Group simulation based evaluations of Essential Biodiversity Variables (EBVs)
Lecture notes/book in progress on computing for conservation genomics
Where's my Heterozygotes at? Observations on genotyping Accuracy
TeraStructure is a new algorithm to fit Bayesian models of genetic variation in human populations on tera-sample-sized data sets (10^12 observed genotypes, i.e., 1M individuals at 1M SNPs). This paβ¦
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning β¦
mixture classification, constraint optimization, outlier detection, population structure, admixture history, and selection detection.
π R package: future: Unified Parallel and Distributed Processing in R for Everyone
R package: parallel computing toolset for relatedness and principal component analysis of SNP data (Development version only)
strataG is a toolkit for haploid sequence and multilocus genetic data summaries, and analyses of population structure.
R package for inferring copy number from read depth
Source code for the program MavericK, described fully at www.bobverity.com/maverick
ππππ: tools for working with categorical variables (factors)
SCAT software for "Smoothed and Continuous Assignment Tests"
Simple and flexible manipulation of genomic data.
Assigner tutorial for Benestan et al. erratum