Now

May 2026. Current technical focus for Zach Olivier.

Working

Post-training harnesses for SFT, DPO, verifier scoring, and best-of-N selection.
Telemetry retention for ML systems: logging budgets, summary interfaces, and failure analysis.
ALife experiments: replicators, program soups, mutation search, and population traces.
Tabular models: foundation-model baselines, uncertainty, runtime, and training dynamics.

Testing

Evaluation signals under distribution shift.
Benchmark design that separates model behavior from dataset fit.
Which project repos need hosted demos, benchmark results, or just a clean README.

Open To

ML engineering and research engineering roles across data, models, evaluation, and production feedback.
Applied research work at labs, scientific organizations, or product teams with a concrete dataset, experiment, or system to ship.