Rob Long

plantimals ·

tilt-shift st.louis

introduction I live in St.Louis, Missouri, where I am focused on industrial genomics (the organization of large-scale datasets for biotechnology), distributed systems, web3, RSS, paleontology, art, and pattern languages.

industrial genomics A blog post I co-wrote with Google Kubernetes Engineers on some of my work on Industrial Genomics: “Bayer Crop Science seeds the future with 15000-node GKE clusters (https://cloud.google.com/blog/products/containers-kubernetes/google-kubernetes-engine-clusters-can-have-up-to-15000-nodes)”. I gave a talk about my work at the Plant and Animal Genome XXVIII Conference (https://www.intlpag.org/2020/) titled: Industrializing Genotype Data on Public Cloud Infrastructure (https://intlpag.org/2020/images/pdf/2020/PAGXXVIII-abstracts-workshops.pdf#page=116&search=%22Rob%20Long%22). in which I discuss my work organizaing genomic/genetic data for Bayer Crop Science, with a massive scale genotype imputation project as the use case. my team of data engineers implemented an imputation engine that takes all varieties of genotype data, along with reference style imputation from Beagle (https://faculty.washington.edu/browning/beagle/beagle.html) and a rich graph of pedigree data (https://plantimals.org/img/maize-galaxy-bcs.png), combined with a novel pedigree-based imputation algorithm to render the best possible view all germplasm given all known genotype observations, in an always-on manner.