ETL is Inferior
ELT > ETL
Pipelines are necessary and transformation is where data becomes useful. The debate is not whether those steps matter, but when they should happen.
Extract. Transform. Load.
That middle step is where the crime happens. You pull data from some source system, immediately start "cleaning" it, drop fields you think nobody needs, coerce types, flatten structures, rename things, dedupe records, and then finally write the polished version somewhere durable.
Congrats, you built a pipeline that can only answer the questions you predicted ahead of time.
The better default is ELT:
Extract. Load. Transform.
Get the data. Store the raw thing. Then transform it after you have preserved reality.



