ETL/ELT Tools

Discussions center on ETL/ELT data integration tools like dbt, Fivetran, Stitch, and their use with cloud data warehouses such as Snowflake, including comparisons, recommendations, and data engineering workflows.

📉 Falling 0.4x Databases
3,841
Comments
20
Years Active
5
Top Authors
#9917
Topic ID

Activity Over Time

2007
1
2008
2
2009
5
2010
15
2011
25
2012
32
2013
41
2014
27
2015
124
2016
123
2017
136
2018
177
2019
288
2020
433
2021
580
2022
501
2023
502
2024
447
2025
362
2026
20

Keywords

HTAP e.g S3 DE AWS DBT BigQuery tower.dev BI CEO data etl snowflake analytics tools warehouse tools like bi data engineering new data

Sample Comments

forgetfulness Jan 16, 2021 View on HN

It's a bit unclear on what the scope of this tool is.But there are a couple of new classes of tools for ETL/ELT or data engineering as it's called now.There's the "Data Integration Tools" like Fivetran, Stitch, and this. They are collections of connectors that they have coded to ease ingesting data from lots of different database products and stores to another. That's valuable, I wouldn't start writing my own script to pull changes from my RDBMS'

BrentBrewington Jul 28, 2023 View on HN

You should definitely check out dbt :)Some links in my comment: https://news.ycombinator.com/item?id=36911937

punknight Jul 18, 2020 View on HN

Thanks for the feedback! We are looking for more info on user needs in this space. Sounds like you currently use Alteryx + Snowflake. Any additional information you could provide about your use case/needs would be helpful. Seems like some people are more interested in open source tools that can be run on their own computer (like DBT) while others are looking for more of an enterprise use case. What about you?

vitorbaptistaa Nov 12, 2019 View on HN

Luigi, AWS S3, DBT, Snowflake and Re:dash (currently analyzing Metabase or Looker to allow queries without SQL)Luigi runs our scrapers and other workflow management tasks (e.g. DB backups).All raw data lives in S3. We make an effort to be able to recreate the whole data warehouse from the raw data, so if any cleaning/normalization process fails, we have this safety net. I'm curious to hear if others use a similar pattern, or if there are better options.DBT handles both loading

acidbaseextract Oct 23, 2020 View on HN

If you're an analyst, I second the recommendation for dbt. Here's a podcast interview with the CEO of the company behind dbt that explains a lot of the philosophy, and I think will help you even if you don't end up using dbt: https://softwareengineeringdaily.com/2020/03/09/dbt-data-bui...

entee Jan 13, 2021 View on HN

You’re missing the need this product addresses. It’s not about your database, it’s about the data someone sent you that you’re about to ETL into your database.I used to work at a medical ML company, the datasets we got from insurance companies and medical providers were generally awful. Occasionally, it didn’t even match the data dictionary they themselves provided. If you need to connect to or ingest outside data sources or check the output of an ETL pipeline, this tool is extremely useful.

mritchie712 Jul 14, 2024 View on HN

Many BI / analytics tools don't have great support for Data Lakes, so part of the reason could be supporting those tools (e.g. they still load some of their data to snowflake to power BI / dashboards)

endlessvoid94 Jan 9, 2023 View on HN

How do you build a data warehouse without ETL?

gravitronic Jun 10, 2016 View on HN

from any data warehouse as well?

carlineng May 1, 2020 View on HN

Excited to follow your progress! I view this problem as the one of the biggest gaps in today's "Cloud Data Ecosystem". Tools like Stitch and Fivetran make it super easy to extract data from source systems; next-gen cloud data platforms like Snowflake make storing, transforming, and querying that data a breeze (especially with the help of tools like dbt and dataform); and there are a ton of powerful and easy to use BI tools for visualizing and digesting that data. But the minute yo