██FR█████ █INTELL███████████
frenchintelligence.org
dataengineering
Orchestrating and Observing Data Pipelines with Airflow, PostgreSQL, and Polar
October 15, 2025
The State of Apache Iceberg v4 – October 2025 Edition
October 14, 2025
Making JSON Compression Searchable — SEE (Schema-Aware Encoding)
October 12, 2025
Lessons Learned from Building Product Dashboards That Drive Real Decisions
October 12, 2025
Lessons Learned from Building Product Dashboards That Drive Real Decisions
October 12, 2025
Understanding the Basics of Linux Operating System
October 11, 2025
Understanding the Basics of Linux Operating System
October 11, 2025
Building Real-Time Data Pipelines from PostgreSQL Using Flink CDC
October 4, 2025
Personal Picks: Data Product News (October 1, 2025)
October 1, 2025
Streams de Dados: Processamento de Informações em Tempo Real
September 28, 2025
Apache Kafka — Deep Dive: Core Concepts, Data-Engineering Applications, and Real-World Production Practices
September 25, 2025
The Ultimate Guide to Open Table Formats: Iceberg, Delta Lake, Hudi, Paimon, and DuckLake
September 25, 2025
Usando Funções de Ordem Superior (Higher-Order Functions – HOFs)
September 25, 2025
How to Pass the AWS Certified Data Engineer – Associate Exam
September 24, 2025
Apache Kafka & Amazon MSK: The Beating Heart of Real-Time Data
September 20, 2025
Apache Iceberg
September 19, 2025
All About Change Data Capture CDC
September 16, 2025
Automating Your Local DBT & Snowflake Playground with Python
September 16, 2025
Why I’m Switching to Parquet for Data Storage
September 15, 2025
Change Data Capture (CDC) in Data Engineering: Concepts, Tools, and Real-World Implementation Strategies
September 14, 2025
Change Data Capture (CDC) in Data Engineering: Concepts, Tools, and Real-World Implementation Strategies
September 14, 2025
Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices
September 13, 2025
Nifi Bundle Release Announcement
September 12, 2025
Column-Oriented Databases: A Technical Overview
September 12, 2025
Aggregation Strategies for Scalable Data Insights: A Technical Perspective
September 10, 2025
How We Use OpenAI and Gemini Batch APIs to Qualify Thousands of Sales Leads
September 9, 2025
Apache Kafka Deep Dive: Concepts, Applications, and Production
September 8, 2025
Apache Kafka Deep Dive: Concepts, Applications, and Production
September 8, 2025
Why Apache Airflow is the Cornerstone of Modern Data Engineering
September 7, 2025
Zero-Downtime Database Migration: The Definitive Guide
September 7, 2025
Zero-Downtime Database Migration: The Definitive Guide
September 7, 2025
Zero-Downtime Database Migration: The Definitive Guide
September 7, 2025
Apache Arrow dev list digest (Aug 25–29 2025)
September 4, 2025
Apache Arrow dev list digest (Aug 25–29 2025)
September 4, 2025
Revamping Real-Time Data Ingestion for Scalable Media Intelligence
September 4, 2025
Two Years of Microsoft Fabric: Game Changer or Still Leveling Up? 🚀
September 3, 2025
Dynamic Routing Lightweight ETL with AWS Lambda, DuckDB, and PyIceberg
September 2, 2025
Data Mesh: The Decentralized Revolution That Will Transform Your Data Architecture
September 1, 2025
Check Out 3 Awesome Open Source Tabular Data Wrangling Apps
August 29, 2025
Good work George
August 28, 2025
Good work George
August 28, 2025
Why We Built Confidence Scoring Into Our Date Parser (And Why Every API Should)
August 28, 2025
Data Modeling: From Basics to Advanced Techniques for Business Impact
August 26, 2025
Is Prompt Engineering Just Hype for Now?
August 23, 2025
Lightweight ETL with AWS Lambda, DuckDB, and delta-rs
August 22, 2025
Personal Picks: Data Product News (August 20, 2025)
August 20, 2025
Tableau Sales Dashboard Performance (Updated for 2025)
August 19, 2025
Build a Lightweight Serverless ETL Pipeline to Iceberg Tables with AWS Lambda Athena
August 19, 2025
Building ML Infrastructure in TypeScript – Part 1: The Vision
August 13, 2025
Engineering with SOLID, DRY, KISS, YAGNI and GRASP
August 13, 2025
15 foundational concepts on Data Engineering
August 12, 2025
[Boost]
August 12, 2025
Core Concepts of Data Engineering: A Practical Guide for Modern Data Teams
August 12, 2025
The Case for Apache Airflow and Kafka in Data Engineering
August 11, 2025
Snowflake RBAC 101
August 11, 2025
A Recap of Data Engineering Concepts
August 11, 2025
Docker Persistence: When and How to Keep Your Container Data
August 9, 2025
What Is a Primary Key in SQL? Learn with Examples
August 8, 2025
What Is a Primary Key in SQL? Learn with Examples
August 8, 2025
AI-Powered Data Engineering Pipelines: Smarter, Faster, Scalable
August 8, 2025
Building My First Production-Ready ELT Pipeline: A Student’s Journey with Docker, PostgreSQL, dbt, and Airflow
August 7, 2025
Is your Vector Database Really Fast?
July 22, 2025
SQL Server 2025 – What’s New and How to Visualize the Schema
July 18, 2025
Apache Iceberg Table Optimization #10:
July 17, 2025
Apache Iceberg Table Optimization #9:
July 17, 2025
Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg
July 17, 2025
Apache Iceberg Table Optimization #7: Using Iceberg Metadata Tables to Determine When Compaction Is Needed
July 17, 2025
Apache Iceberg Table Optimization #5: Avoiding Metadata Bloat with Snapshot Expiration and Rewriting Manifests
July 17, 2025
Apache Iceberg Table Optimization #4: Smarter Data Layout — Sorting and Clustering Iceberg Tables
July 17, 2025
Apache Iceberg Table Optimization #3: Optimizing Compaction for Streaming Workloads in Apache Iceberg
July 17, 2025
Apache Iceberg Table Optimization #2: The Basics of Compaction — Bin Packing Your Data for Efficiency
July 17, 2025
Apache Iceberg Table Optimization #1: The Cost of Neglect — How Apache Iceberg Tables Degrade Without Optimization
July 17, 2025
Data and analytics reimagined with Terraform and DevOps principles
July 16, 2025
Big Data Fundamentals: data pipeline tutorial
July 15, 2025
Big Data Fundamentals: data pipeline tutorial
July 15, 2025
Big Data Fundamentals: data lake
July 10, 2025
Big Data Fundamentals: delta lake example
July 9, 2025
Personal Picks: Data Product News (July 9, 2025)
July 9, 2025
How to Discover or Organize Lakehouse & Apache Iceberg Meetups
July 3, 2025
Big Data Fundamentals: big data tutorial
June 29, 2025
Big Data Fundamentals: big data tutorial
June 28, 2025
The Myth of Sisyphus in Data Engineering
June 27, 2025
How to Document SQL Server Schemas Visually in 2025
June 26, 2025
How to Document SQL Server Schemas Visually in 2025
June 26, 2025
How to Document SQL Server Schemas Visually in 2025
June 26, 2025
How to Document SQL Server Schemas Visually in 2025
June 26, 2025
Data Engineering: The Hero Behind Smart Data Decisions
April 3, 2025
Why Pi-Shaped Teams Matter in This AI Era
March 19, 2025
How fault-tolerance works in Flink
March 16, 2025
Azure For Data Engineering
March 15, 2025
Data Modeling – Entities and Events
October 30, 2024
Análise de dados de tráfego aéreo em tempo real com Spark Structured Streaming e Apache Kafka
October 28, 2024
Data Engineering in 2024: Innovations and Trends Shaping the Future
October 27, 2024
From a Unified Bronze Layer to Multiple Silver Layers: Streamlining Data Transformation in Databricks Unity Catalog
October 20, 2024
*Mastering Informatica Intelligent Cloud Services (IICS) for Cloud Data Integration*
October 18, 2024
Handling Outliers in Python – IQR Method
October 10, 2024
Handling Outliers in Python – IQR Method
October 10, 2024
Go vs Python for File Processing: A Performance and Architecture Perspective
October 9, 2024
End-to-End ETL and Sales Dashboard on WWI dataset in Microsoft Fabric
October 8, 2024
Ultimate Directory of Apache Iceberg Resources
October 5, 2024
1
2
→