Build an ML model
Time to complete: ~10 min (samples) or ~20 min (gold, after ingest)
Before you start
| Path | You need |
|---|---|
| Your data | Gold for ML, status ready |
| Sample | Signed-in account only |
Open the wizard
- Sidebar → Machine Learning (
/ml). - Click New model.
Train from gold (your data)
- Step Data source → under Published dataset (Gold), select your artifact.
- Wait for schema load (target column pre-filled from gold metadata).
- Step Target → confirm column to predict.
- Step Train → name model → optional Advanced (search effort) → Train.
- Training queue tab → open model when status is complete.
No upload step
Gold already passed validation (100+ rows, target, dtypes). The wizard loads from the artifact — you cannot upload CSV here.
Train from sample (explore only)
- Step Data source → scroll below gold picker → Sample datasets.
- Housing (regression,
MEDV) or Iris (classification,Species). - Confirm target → name → Train.
After training
Troubleshooting
| Symptom | Fix |
|---|---|
| Dataset missing from gold list | Data → Gold — confirm ML publish, status ready |
Direct uploads are disabled | Use Lake → gold; only samples bypass Lake |
| Training failed | Check target column; republish gold if schema changed |
| Used zip ingest? | Landed in bronze — still need gold publish |
Next steps
Target column wrong?
Training fails or metrics look wrong. Fix gold metadata and republish if the target changed.