Part Time
3.75/hour
10
Apr 28, 2026
Python Data Engineer – Build Data Validation Tool (CSV Comparison + AI Summary)
Description:
I’m looking for a strong Python data engineer to build a simple MVP tool that compares two datasets and identifies data mismatches.
This is NOT a chatbot or marketing automation project.
This is a structured data validation and comparison tool.
What the tool should do:
* Accept 2 datasets (CSV/Excel): Source and Target
* Allow simple field mapping (e.g., source.emp_id ? target.worker_id)
* Compare data and identify:
* Missing records
* Extra records
* Duplicate records
* Field mismatches
* Null / blank values
* Output a clean error report
* Provide summary metrics (% match, total errors, etc.)
* Generate a short AI summary of findings (using OpenAI API)
Tech Requirements:
* Python (Pandas) – REQUIRED
* Experience with data comparison, ETL, or reconciliation
* Ability to build a simple UI (Streamlit preferred)
* Experience integrating APIs (OpenAI is a plus)
What I am NOT looking for:
* Chatbot developers
* Marketing automation specialists (Zapier-only)
* Prompt engineers without data experience
Deliverable:
A working MVP that:
* Runs end-to-end
* Produces accurate validation results
* Has a simple interface for demo purposes
Timeline:
7–10 days for MVP
Budget:
Open to fixed price based on experience
To Apply:
Please include:
1. How you would approach comparing two datasets with different field names and formats
2. Example of similar work (data validation, ETL, reconciliation)
3. The word “validation” in your response (to confirm you read this)
Bonus:
Experience working with HR, payroll, or enterprise datasets is a plus
---
This is a focused MVP build with potential for ongoing work if this goes well.