resume

Employment History

Syapse      May 2018 - June 2023

Engineering Manager / Principal Engineer

Knowledge management team

Created and managed the knowledge management group that radically improved data quality and created a fully integrated closed loop curated and measurable normalized data pipelines supervised by clinical informaticists and subject matter experts. A hands on member of the team, created and maintained many of the core code and technologies that power the assets of this group.

Developed and maintained a full end to end curated knowledge management system for medical terminologies and inference based on industry standards combined with in-house expert input (clinical informatics, oncologists, variant scientists, clinicians, pharmacists, ontologists, etc.) Applied knowledge models to raw patient medical data.

Some of the challenges tackled:

Clinical, genomic testing, cancer registry, and medication data normalization: raw data comes in from multiple health systems, in-house and outside labs (clinical and sequencing labs), cancer registries, Syapse manual abstractors. The raw data does not conform to uniform standards, comes in many formats, suffers from missingness, and variations in coding. The data is not fit for aggregation and analytics, and has high volume.

Knowledge management (KMS): Cancer is a complex domain, and each cancer has its peculiar aspects. No single expert knows everything. Managing, coordinating, reviewing, and verifying knowledge in a highly inter-disciplinary environment is challenging.

Complex inference: even when you can normalize basic data much remains either missing or too complex to normalize by simple means.

Mortality data: for survival studies, Kaplan-Meier curves, and other critical studies in treatments and pharmaceuticals accurate survival and date of death data is critical. But outside of controlled studies, in real world patients - reliable data is hard to come by.

Data pipeline innovation: The team and I also developed a highly scaleable and cost effective data processing, normalization, and projection system for creating flexible and adaptive end to end analytical pipelines from raw data through normalization and inference, through analytics through final product. This pipeline leverages high partitioning, S3/parquet based storage, integration with the KMS APIs, local and Redis based caching, real time monitoring, execution framework independence, lazy evaluation, schema free.

Key technologies in use:

Key health care standards and practices

Leader, mentor, hands-on developer: delivering every day

github activity 2023


Udemy      May 2017 - 2018

Principal Engineer


Visa Inc.      2013 - 2017

Chief Architect


Previously

A long history in startups in the Silicon Valley in engineering positions ranging from IC, to director, to VP of engineering, to owner and operator. Two major exists. Span of control from 4 to 80 people.

Skills Older Employment History