Career Profile

Life science data aficionado with a product mindset and 7 years of experience working with life science data platforms. With over a decade of research experience in molecular biology and genomics, I understand the needs of scientific teams and serve as a liaison between users, stakeholders, and engineers.

Experience

Senior Scientific Knowledge Engineer

May 2023 - Present
GSK

Product Lead for single cell genomics platform, leading a cross-functional team of data engineers and computational biologists. Lead data harmonization efforts and provide data modeling support for the Onyx Research Data Platform, serving research and early discovery teams.

  • Develop conceptual and logical data models tailored to scientific use cases and business needs
  • Lead omics harmonization to internal schemas, collaborating with users to prioritize most important fields
  • Communicate business priorities and scientific use cases to engineers, bridging the gap between scientific and technical domains of expertise
  • Extension of internal ontologies and code mappings to support clearly defined business needs

Senior Data Analyst

May 2022 - Feb 2023
Velsera (formerly Seven Bridges Genomics)

Data modeling to support a cohort browser for patient health data. My efforts focused on evaluating standardized clinical data models and providing expertise needed to support product strategy, as well as mapping and integration with complex variant information and omics data.

  • Evaluation of OMOP CDM for internal use cases
  • Led PoC for a data obfuscation tool that reduced complexity of testing workflow
  • Member of OHDSI Genomics Working Group

Senior Data Curator

Jan 2021 - May 2022
Harvard Medical School

Data wrangling and product management for the Computational Genome Analysis Platform project, an end-to-end platform for clinical genomics workflows from raw sequencing data to variant interpretation. I led the data model design and managed data submissions to the platform, and planned and defined new platform features.

  • Data model design
  • Onboarded new users and projects
  • Software development for data ingestion and mapping to data model
  • Product management of data submission tool and variant interpretation module
  • Led interviews with clinicians to understand user needs
  • Define product requirements for software developers

Data Curator

Mar 2018 - Jan 2021
Harvard Medical School

Data wrangling for the 4D Nucleome Data Coordination and Integration Center, a repository for genomics and microscopy experimental data and metadata. I work with a large consortium of 100+ labs to ingest new datasets, implement data standards, and adhere to FAIR principles. I use REST APIs with python to manage data and metadata, and contribute to data model and software development.

  • Data model design
  • Managed new data submissions
  • Scientific publication and user-facing documentation
  • Optimized ontology processing code and submitted terms to public ontologies
  • Participated in consortium working groups to set genomics and microscopy data standards
  • Member of Common Fund Data Ecosystem Ontology Working Group

Postdoctoral Scholar

2012 - 2018
Laboratory of Dr. Wallace Marshall, UCSF

I studied the large ciliate Stentor coeruleus, a model for single cell regeneration. I was involved in the Stentor genome project, and investigated the role of CDKs in regeneration of cellular features.

  • Managed collaborations with scientists at other institutions
  • Generated and analyzed RNA-Seq data
  • Discovered Stentor’s unusual 15 bp intron size
  • Used HMMs and molecular evolution approaches to classify Stentor kinases into families
  • Mentored graduate students

PhD Student

2006 - 2012
Laboratory of Dr. Boris Striepen, UGA

I studied the maintenance and replication of the plastid genome in the Apicomplexan parasite Toxoplasma gondii. I constructed conditional genetic mutants and used fluorescence microscopy and qPCR assays to show that the histone-like HU protein is essential for apicoplast DNA replication and proper plastid segregation.

Undergraduate Researcher

2004 - 2006
Laboratory of Dr. Michele Klingbeil, UMass Amherst

I examined the function of the mtDNA polymerase PolIA in the parasite Trypanosoma brucei. I used RNAi to knock down the gene and designed assays with DNA damaging agents to probe a function in mtDNA repair.

Summer REU Intern

2005
Laboratory of Dr. Joseph Jerry, UMass Amherst

I was investigating the role of p53 in DNA double strand break repair. I worked with mice and measured rates of homologous recombination and non-homologous end-joining in p53+/+, p53+/-, and p53-/- mouse cells.

Skills

Python

Excellent written and oral communication

SQL

Use of Cloud environments (AWS, GCP)

Genomics and microscopy data standards

Use of HPC Environments (Slurm)

Biomedical data modeling

Product requirement definition

Genomics data analysis

Extensive research experience

Biomedical ontologies and OWL/RDF

Engagement with End Users

Use of REST APIs

Detail-oriented

Publications

Laß, J., Berselli, M., Rioux, D., Schaake, S., Follett, J., Bravo, J.E., Veit, A.D., Ronchetti, W., Reiff, S.B., Huentelman, M.J., Vuzman, D., Bower, P., Park, P.J., Khurana, V., Trinh, J., Jeon, B., Kim, H., and Farrer, M.J. (2024) Genetic testing for SCA27B in Korean Multiple System Atrophy. medRxiv doi: 10.1101/2024.10.21.24315855

Reiff, S.B., Schroeder, A.J., Kırlı, K., Cosolo, A., Bakker, C., Mercado, L., Lee, S., Veit, A.D., Balashov, A.K., Vitzthum, C., Ronchetti, W., Pitman, K.M., Johnson, J., Ehmsen, S.R., Kerpedjiev, P., Abdennur, N., Imakaev, M., Öztürk, S.U., Çamoğlu, U., Mirny, L.A., Gehlenborg, N., Alver, B.H., and Park, P.J. (2022) The 4D Nucleome Data Portal as a resource for searching and visualizing curated nucleomics data. Nat Comms 13: 2365. doi: 10.1101/2021.10.14.464435

Martins-Duarte, E.S., Sheiner, L., Reiff, S.B., de Souza, W., and Striepen, B. (2021) Replication and partitioning of the apicoplast genome of Toxoplasma gondii is linked to the cell cycle and requires DNA polymerase and gyrase. Int J Parasitol 51(6): 493-504. doi: 10.1016/j.ijpara.2020.11.004

Reiff, S.B. and Marshall, W.F. (2017) A large kinome in a large cell: Stentor coeruleus possesses highly expanded kinase families and novel domain architectures. bioRxiv doi: 10.1101/168187

Slabodnick, M.M.*, Ruby, J.G.*, Reiff, S.B.*, Swart, E.C.*, Gosai, S., Prabakaran, S., Witkowska, E., Larue, G.E., Fisher, S., Freeman, R.M. Jr., Gunawardena, J., Chu, W., Stover, N.A., Gregory, B.D., Nowacki, M., DeRisi, J., Roy, S.W., Marshall, W.F., and Sood, P. (2017) The Macronuclear Genome of Stentor coeruleus Reveals Tiny Introns in a Giant Cell. Curr. Biol. 27(4): 569-575. (* - equal contributions)

Reiff, S.B. and Marshall, W.F. (2015) Stentor, Its Cell Biology and Development. eLS. doi: 10.1002/9780470015902.a0025978 (Review)

Reiff, S.B., Vaishnava, S, and Striepen, B. (2012) The HU Protein is Important for Apicoplast Genome Maintenance and Inheritance in Toxoplasma gondii. Eukaryot. Cell 11(7): 905-915.

Reiff, S.B. and Striepen, B. (2009) Malaria: The Gatekeeper Revealed. Nature 459(7249): 918-919. (Commentary)

van Dooren, G.G., Reiff, S.B., Tomova, C., Meissner, M., Humbel, B.M., and Striepen, B. (2009) A Novel Dynamin-Related Protein Has Been Recruited for Apicoplast Fission in Toxoplasma gondii. Curr. Biol. 19(4): 267-276.

Striepen, B., Jordan, C., Reiff, S., and van Dooren,G.G. (2007) Building the Perfect Parasite: Cell Division in the Apicomplexa. PLoS Path. 6(3): e78. (Review)

Selected Presentations

4DN Annual Meeting - Live Demo of 4DN Visualization Workspace

2020

International Biocuration Conference

2019

American Society for Cell Biology Annual Meeting

2016

The Allied Genetics Conference

2016

Workshops & Training

Product Management Training - Productside

2024

Introduction to Machine Learning with R - Harvard Extension School

2020

Natural Language Processing and Text Analytics - Harvard Extension School

2019

Advanced Python for Data Science - Harvard Extension School

2018

Workshop for Molecular Evolution – MBL, Woods Hole, MA

2016

Intermediate Bioinformatics Techniques – UC Davis, Bodega Bay, CA

2016

Software Carpentry – UC Davis, Davis, CA

2015

Next Gen. Sequencing Data Analysis Course – MSU, Lansing, MI

2013

Fellowships & Awards

GSK Bronze Award - $500

2024

University of Georgia Presidential Graduate Fellowship

2006 - 2011