Skip to main content

Heimdall Lake

Heimdall Lake is the managed medallion data catalog on Heimdall by RejiCo — where you ingest raw data (bronze), curate it (silver), and publish modeling-ready gold datasets before ML or Forecast training.

Before you start
  • Sidebar → Data WarehouseLake (/data)
  • Production ML/Forecast requires Gold — Bronze alone is not enough

Medallion layers

StageTab in /dataMeaning
BronzeBronzeRaw upload — CSV, Excel, zip job output
SilverSilverJoins, filters, Lab outputs
GoldGoldValidated for ML or Forecast training

Add data (UI)

Lake page → Add data (top right):

OptionUse for
StructuredCSV / Excel → Bronze
UnstructuredLabeled zip → Bronze table via ingest job
Create Silver datasetJoin/filter Bronze without Lab
DatabaseComing soon — export CSV today

Common paths

GoalFlow
CSV → ML APIBronze → Gold (ML) → Machine Learning wizard → Path A
Profile firstBronze → Lab → Silver → Gold → ML Path C
Time seriesBronze → Gold (Forecast) → Forecast wizard
Image/text corpusUnstructured zip → Bronze → optional Gold → ML

Lake vs ML wizard upload

Lake → GoldML samples only
Your CSV✅ Required
Validation✅ Before trainingSamples only
Reuse in Forecast

Direct CSV upload in the ML wizard is disabled (verified in app API).

FAQ

Unstructured zip vs Read/Vision APIs?
Zip = bulk training data in Lake. Read/Vision = single-request APIs. See Unstructured suite.

Legacy Forge URLs?
/forge redirects to /data?ingest=unstructured. Unstructured ingest is part of Lake.

Do not skip Gold for production

Training without Gold bypasses row-count and schema checks — failures happen mid-train.

Next steps