Andrea Bozzo

I build data infrastructure in Rust, Python, and Go.

Open-source tools, lakehouse experiments, and engineering notes from the places where pipelines, storage, and developer tooling meet.

Selected proof

The shortest path through the work.

Start here if you want the clearest signal: one built tool, one upstream contribution track, and one writing archive.

Built system

dataprof

Arrow-native profiling in Rust with CLI and Python surfaces, designed for bounded-memory data quality workflows.

Read the case study
Upstream work

Apache Rust contributions

Public PRs across Arrow, DataFusion, Iceberg Rust, and Fluss Rust, tied back to real downstream constraints.

Open the contribution track
Technical writing

Bilingual engineering notes

Long-form articles on data platforms, Rust/Python systems, lakehouse tradeoffs, and open-source project notes.

Browse the archive
Published packages
Committed package registry data
CI health
Committed GitHub Actions runtime data
Dataset index
Committed dataset metadata
Writing archive
Committed bilingual writing index
Upstream repos
Committed contribution index

System View

This site is also a small platform.

The public surface is intentionally hand-built, but the repository behind it is a real delivery system: static homepage, Hugo archive, generated work pages, Rust/WASM workbench, Go harvester, and a Vercel companion API.

Architecture diagram connecting the landing page, Hugo blog, Rust and WebAssembly workbench, Go harvester, GitHub Pages, and Vercel companion API
One repository, two delivery surfaces: GitHub Pages for the static site and Vercel for live GitHub metrics.
Static front door

The landing page is plain HTML, CSS, and JavaScript so the public surface stays lightweight and explicit.

Content + computation

Hugo handles the writing archive, while the workbench logic is mirrored between JavaScript and Rust compiled to WebAssembly.

Generated archive

Go generators turn structured JSON and repository data into case-study pages, contribution cards, and static artifacts.

Live companion

GitHub Pages serves the static site, and Vercel only carries the live GitHub stats and badge endpoints.

Workbench

Search the whole websuite

One input for case studies, blog posts, open-source work, reviewed papers, and the technical threads that connect them.

    Journal

    Blog

    Longer writeups, project notes, and experiments live here.

    Browse every post

    Latest writing

    Writing language

    Open Source

    Open source work

    A few projects I have sent patches to. The list is pulled from the repository README.

    Reviews & Papers

    Reviewed papers and companion repos

    Public material for two IEEE paper submissions, with benchmarks, demos, and reproducible companion assets.

    Contact

    Business inquiries and recruiting

    For freelance data infrastructure work, recruiting context, or technical follow-up, email is the cleanest first step.