Tagged · Testing for AI
Field notes,
Testing for AI.
25 articles in this tag — part of the Jaypore Labs journal.
- 01Engineering
Determinism harnesses for non-deterministic systems
Apr 30, 20262 min read - 02Engineering
Integration tests for AI features: contract or behavioural?
Apr 27, 20263 min read - 03Engineering
CI strategy: smoke vs. full suite for LLM apps
Apr 24, 20262 min read - 04Engineering
End-to-end tests for AI workflows: scope and survival
Apr 23, 20262 min read - 05Engineering
The post-launch test plan: what runs forever
Apr 15, 20263 min read - 06Engineering
Tests for retrieval pipelines
Apr 10, 20262 min read - 07Engineering
Cost tests: catching the prompt that doubled spend
Apr 9, 20262 min read - 08Engineering
UX tests for AI-generated content
Apr 8, 20262 min read - 09Engineering
Tests for tool-using agents: trace assertions
Apr 2, 20263 min read - 10Engineering
Property-based testing for LLM features
Apr 1, 20262 min read - 11Engineering
Regression cohorts: catching what evals miss
Mar 26, 20263 min read - 12Engineering
Drift tests vs. functional tests: separate lanes
Mar 25, 20263 min read - 13Engineering
Privacy tests: PII redaction assertions
Mar 24, 20262 min read - 14Engineering
Golden-set discipline
Mar 20, 20263 min read - 15Engineering
Test-data management for AI: synthetic vs. real
Mar 19, 20262 min read - 16Engineering
Behavioural assertions: testing 'should-ness'
Mar 18, 20262 min read - 17Engineering
PII in test fixtures: the boring legal slope
Mar 16, 20263 min read - 18Engineering
Mock LLMs in tests: when to fake, when to call
Mar 12, 20263 min read - 19Engineering
The new test pyramid for AI products
Mar 11, 20267 min read - 20Engineering
Security tests: prompt-injection regression suite
Mar 10, 20262 min read - 21Engineering
Snapshot tests: where they help, where they trap
Mar 5, 20262 min read - 22Engineering
Tests for streaming responses
Mar 5, 20262 min read - 23Engineering
Accessibility tests for AI surfaces
Mar 3, 20262 min read - 24Engineering
Eval-driven development
Mar 3, 20263 min read - 25Engineering
Performance tests: token budgets and latency SLAs
Mar 3, 20262 min read