Hi! I'm Josh, a full-stack and product developer building across financial, design, and product systems.

  • Shipping across product, engineering, and design.
  • Focused on useful systems over decorative complexity.
  • Most interested in interfaces that feel clear and alive.

Work

LLM QA

Annotated and evaluated NLP datasets for LLM training, benchmarking, and quality assurance.

Start a conversation

Cohere

2025 - 2026

Sep 2025 - Jan 2026

ML Data Annotation (Contract)

Dataset evaluation and QA work supporting production LLM training.

Contract work focused on dataset quality and evaluation for production language model workflows.

Dataset annotationBenchmarkingQA

Detail

Scope

Annotated and evaluated NLP datasets used in model training, benchmarking, and QA processes.

Outcomes

Supported production model training and benchmarking workflows.
Contributed evaluation and QA work across NLP datasets.

Stack

NLP datasetsEvaluation workflows
Back to selected workDiscuss a similar project