Any
90000
40
Nov 12, 2025
BigQuery Data Engineer (Google Cloud / Research Data Validation)
Doctors and researchers generate massive amounts of biological data every week—but most of it never gets fully understood.
Our Proof Computation Framework (PCF) helps them identify stable, reproducible patterns that can shorten research timelines and improve validation across every medical field. Built on Google Cloud and BigQuery, it’s designed to make research data cleaner, faster, and provable.
We’re looking for a BigQuery-first data engineer who can manage real research datasets inside Google Cloud. You’ll handle ingestion, cleaning, and validation for structured datasets from doctors and labs—making sure every table is consistent, documented, and ready for analysis.
This is not a data-entry or dashboard role. It’s technical, precise work focused on integrity and proof.
Responsibilities:
* Ingest and organize datasets (CSV, Parquet, JSON) into BigQuery
* Build efficient SQL queries (joins, aggregations, CTEs, window functions)
* Maintain schema consistency, naming standards, and documentation
* Run Python notebooks for data validation and reproducibility checks
* Manage Google Cloud Storage (GCS) staging and access control
* Produce validation receipts — row counts, timestamps, schema proofs
Required Skills:
* Strong BigQuery experience — CTEs, partitioning, clustering, optimization
* Python (intermediate) — pandas, validation notebooks, data checks
* Google Cloud tools — BigQuery, GCS, IAM basics, Secret Manager
* Version control — Git/GitHub (commits, pull requests, SQL versioning)
* Data reproducibility mindset — everything you do must be verifiable
Nice to Have:
* Experience with Dataform or dbt pipelines
* Background in bioinformatics or scientific data
* Familiarity with Google Cloud DLP and privacy standards
* Understanding of data auditing and proof workflows
We Don’t Need:
* App developers or AI model builders
* Dashboard designers or data-entry assistants
* Guesswork or undocumented work — every operation must be reproducible
Work Setup:
* Full-time preferred (part-time onboarding possible)
* Flexible schedule with U.S. or PH overlap
* Long-term role with growth into Data Validation Lead
* Competitive compensation based on GCP experience
To Apply:
Send:
1. A short intro describing your BigQuery + Python background
2. An example query or validation you’ve written