Day 1 of 12: Understanding Key Terms for Data Professionals
Data Warehouse, Data Lake, Data Lakehouse – What's the Difference?
- Data Warehouse: Centralized storage for structured data, widely used since the 1990s. Perfect for reporting and analysis.
- Data Lake: Designed to handle unstructured and semi-structured data - developed in response to the limitations of data warehouses.
- Data Lakehouse: A modern hybrid solution combining the best of both worlds—efficient querying of structured data and flexibility for unstructured data in a single system.
💻Today's Small Practical Project:
Create a mini data lake with AWS S3: Upload JSON or CSV data to an S3 bucket, then process the data with Python and perform data analysis with Pandas, for example.
#data #datascience #dataengineering #programming #python
To learn more: https://medium.com/towards-data-science/the-concepts-data-professionals-should-know-in-2025-part-1-47e7e797801d
I am starting an email newsletter about time series analysis and forecasting. It is still WIP, but you can subscribe here:
https://the-forecaster.beehiiv.com/subscribe
#timeseries #forecasting #datascience #RStats #Python
Python recap for week 3/2025
https://discu.eu/weekly/python/2025/3/
Get RSS feeds and support this bot with the premium plan: https://discu.eu/premium
📅 Tomorrow at 10am ET! Tune in to @TalkPython To Me Podcast with @calvinhp and @mkennedy! 🎙️ They’ll explore Scaf™, the complete blueprint for new #Python #Kubernetes projects.
👉 Watch live on YouTube: https://loom.ly/CylR5W8
#Productivity #DevOps #ScafChallenge
In addition to that, its module-based intuitive UI to design DAQ schemes and integration of native #Python allows it to be extended easily for a wide array of data-acquisition and closed-loop intervention tasks. For many experiments, no programming skills are needed at all! 3/x
running python script permanently in background , nohup is not doing the job? #python
https://askubuntu.com/q/1538687/612
fruitstand: A Library for Regression Testing LLMs
https://github.com/deckard-designs/fruitstand
Discussions: https://discu.eu/q/https://github.com/deckard-designs/fruitstand
#Exercitium: Primos consecutivos con media capicúa. https://jaalonso.github.io/exercitium/posts/2025/01/20-primos_consecutivos_con_media_capicua/ #Haskell #Python #Matemáticas
🐍 On this Building SaaS with #Python and #Django, I'm building a new feature that allows users to do bulk deletion on tasks that they create for their students' courses. https://www.youtube.com/watch?v=heWgYkMh1bw
Grille de quadrilatères et leurs homothétiques.
#geogebra + #python #pyggb #homothétie
https://www.geogebra.org/m/r3kyd6jh
Fetch tweets into Sheets with Python 🐦📱🔥
Curate content or listen socially! 🌟
#GoogleSheets #Python #TwitterAPI #SocialMedia