██FR█████ █INTELL███████████
frenchintelligence.org
dataengineering
🔐 Understanding Governance in Microsoft Fabric
December 2, 2025
🔐 Understanding Governance in Microsoft Fabric
December 2, 2025
Clean Code in ETL:How Python, Go, and SQL Each Teach You to Think Differently
December 2, 2025
Clean Code in ETL:How Python, Go, and SQL Each Teach You to Think Differently
December 2, 2025
Building a Data Platform on AWS: Essential Design Considerations for Power BI
December 1, 2025
Building a Data Platform on AWS: Essential Design Considerations for Power BI
December 1, 2025
Part 1: Snowflake’s Autonomous Future
November 18, 2025
Why Your Snowflake Bill is High and How to Fix It with a Hybrid Approach
November 15, 2025
From Pandas to Upstream Control: The Evolution PyData Needs Next
November 12, 2025
From Pandas to Upstream Control: The Evolution PyData Needs Next
November 12, 2025
How We Cut LLM Batch Inference Time in Half with Dynamic Prefix Bucketing
November 10, 2025
How We Cut LLM Batch Inference Time in Half with Dynamic Prefix Bucketing
November 10, 2025
How We Cut LLM Batch Inference Time in Half with Dynamic Prefix Bucketing
November 10, 2025
From 30 Minutes to 5: Solving Data Pipeline Deployment Bottlenecks with Git Sparse Checkout
November 6, 2025
From 30 Minutes to 5: Solving Data Pipeline Deployment Bottlenecks with Git Sparse Checkout
November 6, 2025
1 billion JSON records, 1-second query response: Apache Doris vs. ClickHouse, Elasticsearch, and PostgreSQL
November 4, 2025
How to Data Engineer the ETLFunnel Way
November 1, 2025
How to Data Engineer the ETLFunnel Way
November 1, 2025
How to Data Engineer the ETLFunnel Way
November 1, 2025
Designing Data-Intensive Applications — Chapter 1: Reliable, Scalable, and Maintainable Applications
October 29, 2025
End-to-End Data Workflow: Kestra, Redshift, and dbt Integration
October 29, 2025
End-to-End Data Workflow: Kestra, Redshift, and dbt Integration
October 29, 2025
Fixing Type Hints for Callable Objects with Custom Signatures in Dagster
October 28, 2025
Set up an open-source AI analyst for PostgreSQL in 2 minutes
October 24, 2025
Evolution of Processing: SPL One-Click Acceleration for Log-to-Metric Conversion
October 22, 2025
An Exploration of the Commercial Iceberg Catalog Ecosystem
October 21, 2025
My First Data Engineering Project: Building a Real-Time IoT Pipeline on Azure
October 20, 2025
Data Engineering 102: Understanding Transactions, ACID, and Isolation in PostgreSQL
October 20, 2025
Orchestrating and Observing Data Pipelines with Airflow, PostgreSQL, and Polar
October 15, 2025
The State of Apache Iceberg v4 – October 2025 Edition
October 14, 2025
Making JSON Compression Searchable — SEE (Schema-Aware Encoding)
October 12, 2025
Lessons Learned from Building Product Dashboards That Drive Real Decisions
October 12, 2025
Lessons Learned from Building Product Dashboards That Drive Real Decisions
October 12, 2025
Understanding the Basics of Linux Operating System
October 11, 2025
Understanding the Basics of Linux Operating System
October 11, 2025
Building Real-Time Data Pipelines from PostgreSQL Using Flink CDC
October 4, 2025
Personal Picks: Data Product News (October 1, 2025)
October 1, 2025
Streams de Dados: Processamento de Informações em Tempo Real
September 28, 2025
Apache Kafka — Deep Dive: Core Concepts, Data-Engineering Applications, and Real-World Production Practices
September 25, 2025
The Ultimate Guide to Open Table Formats: Iceberg, Delta Lake, Hudi, Paimon, and DuckLake
September 25, 2025
Usando Funções de Ordem Superior (Higher-Order Functions – HOFs)
September 25, 2025
How to Pass the AWS Certified Data Engineer – Associate Exam
September 24, 2025
Apache Kafka & Amazon MSK: The Beating Heart of Real-Time Data
September 20, 2025
Apache Iceberg
September 19, 2025
All About Change Data Capture CDC
September 16, 2025
Automating Your Local DBT & Snowflake Playground with Python
September 16, 2025
Why I’m Switching to Parquet for Data Storage
September 15, 2025
Change Data Capture (CDC) in Data Engineering: Concepts, Tools, and Real-World Implementation Strategies
September 14, 2025
Change Data Capture (CDC) in Data Engineering: Concepts, Tools, and Real-World Implementation Strategies
September 14, 2025
Apache Kafka Deep Dive: Core Concepts, Data Engineering Applications, and Real-World Production Practices
September 13, 2025
Nifi Bundle Release Announcement
September 12, 2025
Column-Oriented Databases: A Technical Overview
September 12, 2025
Aggregation Strategies for Scalable Data Insights: A Technical Perspective
September 10, 2025
How We Use OpenAI and Gemini Batch APIs to Qualify Thousands of Sales Leads
September 9, 2025
Apache Kafka Deep Dive: Concepts, Applications, and Production
September 8, 2025
Apache Kafka Deep Dive: Concepts, Applications, and Production
September 8, 2025
Why Apache Airflow is the Cornerstone of Modern Data Engineering
September 7, 2025
Zero-Downtime Database Migration: The Definitive Guide
September 7, 2025
Zero-Downtime Database Migration: The Definitive Guide
September 7, 2025
Zero-Downtime Database Migration: The Definitive Guide
September 7, 2025
Apache Arrow dev list digest (Aug 25–29 2025)
September 4, 2025
Apache Arrow dev list digest (Aug 25–29 2025)
September 4, 2025
Revamping Real-Time Data Ingestion for Scalable Media Intelligence
September 4, 2025
Two Years of Microsoft Fabric: Game Changer or Still Leveling Up? 🚀
September 3, 2025
Dynamic Routing Lightweight ETL with AWS Lambda, DuckDB, and PyIceberg
September 2, 2025
Data Mesh: The Decentralized Revolution That Will Transform Your Data Architecture
September 1, 2025
Check Out 3 Awesome Open Source Tabular Data Wrangling Apps
August 29, 2025
Good work George
August 28, 2025
Good work George
August 28, 2025
Why We Built Confidence Scoring Into Our Date Parser (And Why Every API Should)
August 28, 2025
Data Modeling: From Basics to Advanced Techniques for Business Impact
August 26, 2025
Is Prompt Engineering Just Hype for Now?
August 23, 2025
Lightweight ETL with AWS Lambda, DuckDB, and delta-rs
August 22, 2025
Personal Picks: Data Product News (August 20, 2025)
August 20, 2025
Tableau Sales Dashboard Performance (Updated for 2025)
August 19, 2025
Build a Lightweight Serverless ETL Pipeline to Iceberg Tables with AWS Lambda Athena
August 19, 2025
Building ML Infrastructure in TypeScript – Part 1: The Vision
August 13, 2025
Engineering with SOLID, DRY, KISS, YAGNI and GRASP
August 13, 2025
15 foundational concepts on Data Engineering
August 12, 2025
[Boost]
August 12, 2025
Core Concepts of Data Engineering: A Practical Guide for Modern Data Teams
August 12, 2025
The Case for Apache Airflow and Kafka in Data Engineering
August 11, 2025
Snowflake RBAC 101
August 11, 2025
A Recap of Data Engineering Concepts
August 11, 2025
Docker Persistence: When and How to Keep Your Container Data
August 9, 2025
What Is a Primary Key in SQL? Learn with Examples
August 8, 2025
What Is a Primary Key in SQL? Learn with Examples
August 8, 2025
AI-Powered Data Engineering Pipelines: Smarter, Faster, Scalable
August 8, 2025
Building My First Production-Ready ELT Pipeline: A Student’s Journey with Docker, PostgreSQL, dbt, and Airflow
August 7, 2025
Is your Vector Database Really Fast?
July 22, 2025
SQL Server 2025 – What’s New and How to Visualize the Schema
July 18, 2025
Apache Iceberg Table Optimization #10:
July 17, 2025
Apache Iceberg Table Optimization #9:
July 17, 2025
Apache Iceberg Table Optimization #8: Hidden Pitfalls — Compaction and Partition Evolution in Apache Iceberg
July 17, 2025
Apache Iceberg Table Optimization #7: Using Iceberg Metadata Tables to Determine When Compaction Is Needed
July 17, 2025
Apache Iceberg Table Optimization #5: Avoiding Metadata Bloat with Snapshot Expiration and Rewriting Manifests
July 17, 2025
Apache Iceberg Table Optimization #4: Smarter Data Layout — Sorting and Clustering Iceberg Tables
July 17, 2025
Apache Iceberg Table Optimization #3: Optimizing Compaction for Streaming Workloads in Apache Iceberg
July 17, 2025
Apache Iceberg Table Optimization #2: The Basics of Compaction — Bin Packing Your Data for Efficiency
July 17, 2025
Apache Iceberg Table Optimization #1: The Cost of Neglect — How Apache Iceberg Tables Degrade Without Optimization
July 17, 2025
1
2
→