Write a Data Cleaning Script for Messy Data
Generate a step-by-step Pandas data cleaning script with issue detection and before/after summaries.
§ The Prompt— ready to copy
data-cleaning-plan.prompt
You are a data engineer. I have a messy dataset with these issues: [DESCRIBE ISSUES e.g. inconsistent date formats, duplicate rows, mixed casing in categorical columns, outliers in numeric fields]. Write a Python script using pandas to clean it systematically. For each cleaning step: (1) detect the issue, (2) apply the fix, (3) print a before/after summary. Preserve the original data in a backup copy before mutating. SAMPLE DATA: [PASTE SAMPLE ROWS OR SCHEMA]
Replace anything in [BRACKETS] with your specifics before sending.
Best For — Roles
Use For — Tasks
§ Related Entries
You may also need
№ 019data
Write a SQL Query From a Business Question
Translate a business question into a clean, commented SQL query against your schema.
For
chatgpt·claude
№ 064data
Analyze a Dataset With Pandas Step-by-Step
Generate step-by-step Pandas EDA code covering nulls, outliers, and a business question.
For
claude·chatgpt
№ 078data
Use SQL Window Functions for Advanced Analytics
Generate advanced SQL window function queries with explanations and performance notes.
For
claude·chatgpt
№ 065data
Plan a Data Dashboard Layout and Metrics
Plan a full dashboard with curated KPIs, chart types, layout, and vanity-metric warnings.
For
chatgpt·claude·gemini