Category · Engineering
Field notes in
Engineering.
186 articles in this category — part of the Jaypore Labs journal.
- 01Engineering
The AI productivity playbook: a real engineer's day
May 15, 20268 min read - 02Engineering
Claude Code + PostHog: analytics-aware development
May 14, 20267 min read - 03Engineering
Claude Code + Sentry: incident debugging as conversation
May 13, 20267 min read - 04Engineering
Claude Code + Supabase: a working integration via MCP
May 12, 20267 min read - 05Engineering
Effective MCP patterns: keeping AI tools safe at scale
May 11, 20268 min read - 06Engineering
MCP fundamentals: connecting your AI tools to your team's stack
May 8, 20268 min read - 07Engineering
Claude Code vs. Codex: which to reach for
May 7, 20267 min read - 08Engineering
Getting started with Codex: install to first real task
May 6, 20267 min read - 09Engineering
Getting started with Claude Code: install to first real task
May 5, 20268 min read - 10Engineering
AI tools for software engineers: a practical orientation
May 4, 20267 min read - 11Engineering
Determinism harnesses for non-deterministic systems
Apr 30, 20262 min read - 12Engineering
Multi-agent orchestration: from kitchen brigade to opera
Apr 30, 20263 min read - 13Engineering
Retry strategies that don't compound errors
Apr 30, 20263 min read - 14Engineering
Tech lead: PR reviews deeper than 'lgtm'
Apr 30, 20264 min read - 15Engineering
Your first MCP server (Node)
Apr 29, 20268 min read - 16Engineering
MCP error handling: tell the model what went wrong
Apr 29, 20262 min read - 17Engineering
Security: threat-model first draft from architecture
Apr 29, 20264 min read - 18Engineering
What makes an eval good
Apr 29, 20267 min read - 19Engineering
Data: pipeline DAG explainer + drift detector
Apr 28, 20265 min read - 20Engineering
MCP for CI/CD: build-system tools as agent inputs
Apr 28, 20262 min read - 21Engineering
Trend evals vs. threshold evals
Apr 28, 20262 min read - 22Engineering
Backend: API design + endpoint scaffolding
Apr 27, 20269 min read - 23Engineering
Data: SQL refactors and lineage maps
Apr 27, 20265 min read - 24Engineering
Fall-back chains: cheap → expensive → human
Apr 27, 20263 min read - 25Engineering
Integration tests for AI features: contract or behavioural?
Apr 27, 20263 min read - 26Engineering
CI strategy: smoke vs. full suite for LLM apps
Apr 24, 20262 min read - 27Engineering
Self-consistency: when N=3 beats a smarter prompt
Apr 24, 20263 min read - 28Engineering
SRE: postmortem first drafts that don't blame
Apr 24, 20265 min read - 29Engineering
Tech writer: doc audits that catch what humans miss
Apr 24, 20264 min read - 30Engineering
Cost guardrails: stop runaway agents before billing does
Apr 23, 20266 min read - 31Engineering
End-to-end tests for AI workflows: scope and survival
Apr 23, 20262 min read - 32Engineering
MCP for actioning tools (PR creator, ticket closer)
Apr 23, 20262 min read - 33Engineering
Frontend: accessibility passes that finally get done
Apr 22, 20264 min read - 34Engineering
MCP and the Claude Code workflow specifically
Apr 22, 20262 min read - 35Engineering
Pairwise judges: A/B agreement at scale
Apr 22, 20262 min read - 36Engineering
Pinning model versions through provider migrations
Apr 22, 20262 min read - 37Engineering
Drift catchers: detecting style shifts
Apr 21, 20262 min read - 38Engineering
Eval CI: the pass/fail gate that's actually useful
Apr 21, 20262 min read - 39Engineering
Prompt invariance: prompts that survive paraphrase
Apr 21, 20263 min read - 40Engineering
Tool failure modes: timeouts, retries, idempotency
Apr 21, 20264 min read - 41Engineering
Context engineering: what to load, what to defer
Apr 20, 20264 min read - 42Engineering
Output validation: pydantic, zod, and friends in production
Apr 20, 20262 min read - 43Engineering
Versioning model + prompt as a unit
Apr 20, 20263 min read - 44Engineering
Backend: database migrations without fear
Apr 17, 20265 min read - 45Engineering
ML: feature-store query rewrites
Apr 17, 20264 min read - 46Engineering
Building agents that explain themselves
Apr 16, 20263 min read - 47Engineering
Constrained decoding: the underrated lever
Apr 16, 20263 min read - 48Engineering
Mobile (Android): Compose rollout audits
Apr 16, 20264 min read - 49Engineering
Safety guardrails: refusal patterns that don't make agents useless
Apr 16, 20263 min read - 50Engineering
Confidence calibration: when 'I don't know' is the answer
Apr 15, 20263 min read - 51Engineering
Counter-example mining
Apr 15, 20263 min read - 52Engineering
The post-launch test plan: what runs forever
Apr 15, 20263 min read - 53Engineering
SRE: runbook generation that captures the response
Apr 15, 20265 min read - 54Engineering
LLM evals are restaurant health inspections
Apr 14, 20264 min read - 55Engineering
Retiring an agent
Apr 14, 20263 min read - 56Engineering
Long-horizon tasks: keeping an agent on rails for hours
Apr 13, 20264 min read - 57Engineering
MCP authorization: per-user permissions
Apr 13, 20262 min read - 58Engineering
MCP composition: when one server should call another
Apr 13, 20262 min read - 59Engineering
MCP server versioning: shipping breaking changes safely
Apr 13, 20262 min read - 60Engineering
MCP transport: stdio vs. HTTP vs. SSE
Apr 13, 20262 min read - 61Engineering
Deploying agents in CI: scoped, audited, repeatable
Apr 10, 20267 min read - 62Engineering
Caching deterministic prefixes
Apr 10, 20263 min read - 63Engineering
Eval result storage and versioning
Apr 10, 20262 min read - 64Engineering
Tests for retrieval pipelines
Apr 10, 20262 min read - 65Engineering
Beyond MCP: tool-use specs in major models
Apr 9, 20262 min read - 66Engineering
Cost tests: catching the prompt that doubled spend
Apr 9, 20262 min read - 67Engineering
The judge pattern for confidence
Apr 9, 20263 min read - 68Engineering
MCP in 10 minutes
Apr 9, 20266 min read - 69Engineering
QA: test-plan generation from acceptance criteria
Apr 9, 20265 min read - 70Engineering
Versioning agent behaviour: prompts as source code
Apr 8, 20263 min read - 71Engineering
UX tests for AI-generated content
Apr 8, 20262 min read - 72Engineering
Agent observability: traces that tell you what happened
Apr 7, 20266 min read - 73Engineering
Eval anti-patterns: when evals make products worse
Apr 7, 20263 min read - 74Engineering
Browsing agents: scraping vs. structured tools
Apr 6, 20263 min read - 75Engineering
Eval-driven prompt iteration
Apr 6, 20262 min read - 76Engineering
Tool-use evals: right tool, right order
Apr 6, 20262 min read - 77Engineering
Voice-first agents: the latency budget you live within
Apr 6, 20263 min read - 78Engineering
Agent memory: what to write down, what to forget
Apr 3, 20263 min read - 79Engineering
Hallucination checks: cite-or-it-didn't-happen
Apr 3, 20263 min read - 80Engineering
MCP server observability
Apr 3, 20262 min read - 81Engineering
Prompt evolution: how agents get worse without you noticing
Apr 3, 20263 min read - 82Engineering
Red-teaming your own prompt
Apr 3, 20263 min read - 83Engineering
EM: 1:1 prep + roadmap sanity check
Apr 2, 20264 min read - 84Engineering
Frontend: component scaffolding + state machines
Apr 2, 20264 min read - 85Engineering
Full-stack: a real feature in an afternoon
Apr 2, 20265 min read - 86Engineering
Tests for tool-using agents: trace assertions
Apr 2, 20263 min read - 87Engineering
MCP authentication: tokens, scopes, OAuth
Apr 1, 20262 min read - 88Engineering
MCP server rate limits: the polite-rejection pattern
Apr 1, 20262 min read - 89Engineering
Property-based testing for LLM features
Apr 1, 20262 min read - 90Engineering
Building your first eval set from scratch
Mar 31, 20268 min read - 91Engineering
Evals for agents: trajectory + outcome
Mar 31, 20267 min read - 92Engineering
MCP and secrets management
Mar 31, 20262 min read - 93Engineering
MCP server hosting: local, sidecar, remote
Mar 31, 20262 min read - 94Engineering
MCP tool naming: making tools discoverable
Mar 31, 20262 min read - 95Engineering
LLM-as-judge: when to trust it, when not
Mar 30, 20267 min read - 96Engineering
MCP for data tools (Postgres, BigQuery, S3)
Mar 30, 20262 min read - 97Engineering
Structured output: JSON mode, schemas, why one beats the other
Mar 30, 20267 min read - 98Engineering
Idempotency keys for LLM calls
Mar 27, 20263 min read - 99Engineering
OSS maintainer: triage + contributor-guide updates
Mar 27, 20264 min read - 100Engineering
Prompts are recipes, not spells
Mar 27, 20264 min read - 101Engineering
Why we need MCP at all
Mar 27, 20262 min read - 102Engineering
Human eval workflows: instructions that don't vary
Mar 26, 20262 min read - 103Engineering
Judging open-ended output without a rubric
Mar 26, 20262 min read - 104Engineering
MCP tool schemas: arg shapes that help
Mar 26, 20262 min read - 105Engineering
Regression cohorts: catching what evals miss
Mar 26, 20263 min read - 106Engineering
Code-writing agents: the test-first discipline
Mar 25, 20263 min read - 107Engineering
Drift tests vs. functional tests: separate lanes
Mar 25, 20263 min read - 108Engineering
Plan vs. act: the agent loop everyone gets wrong
Mar 25, 20266 min read - 109Engineering
Privacy tests: PII redaction assertions
Mar 24, 20262 min read - 110Engineering
Sub-agents: when 1+1 actually equals 2
Mar 24, 20264 min read - 111Engineering
Calibrating your judge: meta-evals
Mar 23, 20262 min read - 112Engineering
Security: code-pattern audits and CVE sweeps
Mar 23, 20264 min read - 113Engineering
Tool design: write tools the way you write APIs
Mar 23, 20268 min read - 114Engineering
Golden-set discipline
Mar 20, 20263 min read - 115Engineering
Why probabilistic systems still need deterministic contracts
Mar 20, 20267 min read - 116Engineering
Refusal grammars: predictable, not surprising
Mar 20, 20263 min read - 117Engineering
MCP for internal tools (Linear, Notion, Slack analogues)
Mar 19, 20262 min read - 118Engineering
ML: eval harness from a spec
Mar 19, 20264 min read - 119Engineering
Multimodal agents: when adding vision actually helps
Mar 19, 20264 min read - 120Engineering
Test-data management for AI: synthetic vs. real
Mar 19, 20262 min read - 121Engineering
Behavioural assertions: testing 'should-ness'
Mar 18, 20262 min read - 122Engineering
Eval taxonomy: golden, behavioural, drift, safety
Mar 18, 20263 min read - 123Engineering
Evals for retrieval: separating retrieval from synthesis
Mar 18, 20262 min read - 124Engineering
Your first MCP server (Python)
Mar 18, 20262 min read - 125Engineering
Agent A/B tests: comparing without confusing your users
Mar 17, 20263 min read - 126Engineering
The deterministic-envelope pattern
Mar 17, 20263 min read - 127Engineering
MCP and prompt injection: ambient instructions
Mar 17, 20262 min read - 128Engineering
Few-shot drift: why golden examples poison new versions
Mar 16, 20263 min read - 129Engineering
The judge pattern: agents that grade other agents
Mar 16, 20264 min read - 130Engineering
PII in test fixtures: the boring legal slope
Mar 16, 20263 min read - 131Engineering
Architect: vendor-comparison architecture doc
Mar 13, 20263 min read - 132Engineering
A senior engineer's day with Claude Code
Mar 13, 20269 min read - 133Engineering
Skills files: recipes the model can call
Mar 13, 20264 min read - 134Engineering
Evals that survive a model bump
Mar 12, 20263 min read - 135Engineering
Managed agents: when to reach for them
Mar 12, 20264 min read - 136Engineering
Mock LLMs in tests: when to fake, when to call
Mar 12, 20263 min read - 137Engineering
The red set: adversarial cases you're allowed to fail
Mar 12, 20262 min read - 138Engineering
The new test pyramid for AI products
Mar 11, 20267 min read - 139Engineering
Per-feature evals vs. per-model evals
Mar 11, 20262 min read - 140Engineering
Sampling production traffic for eval
Mar 11, 20262 min read - 141Engineering
Security tests: prompt-injection regression suite
Mar 10, 20262 min read - 142Engineering
Temperature, top-p, and the production tradeoff
Mar 10, 20263 min read - 143Engineering
QA: flaky test triage at scale
Mar 9, 20265 min read - 144Engineering
DevOps: CI pipeline diagnosis at 2am
Mar 6, 20264 min read - 145Engineering
DevOps: Terraform refactor with a watchful copilot
Mar 6, 20265 min read - 146Engineering
The future of MCP
Mar 6, 20262 min read - 147Engineering
MCP testing: harnesses, fixtures, regressions
Mar 6, 20262 min read - 148Engineering
Output post-processors that don't hide the truth
Mar 6, 20263 min read - 149Engineering
Authoring eval cases
Mar 5, 20262 min read - 150Engineering
Snapshot tests: where they help, where they trap
Mar 5, 20262 min read - 151Engineering
Tests for streaming responses
Mar 5, 20262 min read - 152Engineering
Agent rollback: kill switches on day one
Mar 4, 20263 min read - 153Engineering
Determinism for tool calls: keys, ordering, side-effects
Mar 4, 20262 min read - 154Engineering
Output diffing in CI
Mar 4, 20263 min read - 155Engineering
Reading an eval dashboard
Mar 4, 20262 min read - 156Engineering
Accessibility tests for AI surfaces
Mar 3, 20262 min read - 157Engineering
Eval-driven development
Mar 3, 20263 min read - 158Engineering
Eval ownership in an org: PM, eng, or QA?
Mar 3, 20262 min read - 159Engineering
Performance tests: token budgets and latency SLAs
Mar 3, 20262 min read - 160Engineering
Auto-generated eval cases from production logs
Mar 2, 20262 min read - 161Engineering
Eval cost management
Mar 2, 20262 min read - 162Engineering
Mobile (iOS): UIKit-to-SwiftUI translation
Mar 2, 20264 min read - 163Engineering
AI-native debugging: the rubber duck got smarter
Feb 26, 20264 min read - 164Engineering
Claude Code + Jira: standups without the standing
Feb 25, 20263 min read - 165Engineering
Multi-model routing: the dispatcher pattern for LLMs
Feb 20, 20264 min read - 166Engineering
Claude Code + Linear: where work lives, the agent lives
Feb 19, 20263 min read - 167Engineering
Semantic caching: why your top 1% of queries cost 60% of your bill
Feb 17, 20264 min read - 168Engineering
Claude Code + Notion: docs become structured data
Feb 16, 20264 min read - 169Engineering
AI cost attribution: a chargeback model for LLM spend
Feb 12, 20264 min read - 170Engineering
Claude Code + Slack: standups, escalations, and the back-channel
Feb 11, 20263 min read - 171Engineering
AI latency budgets: borrowing from network engineering
Feb 9, 20264 min read - 172Engineering
AI feature flags: a model rollout looks like a deployment
Feb 5, 20264 min read - 173Engineering
Claude Code + Datadog: 2 a.m. is for the agent now
Feb 4, 20264 min read - 174Engineering
AI canary deployments: 1% traffic, 100% paranoia
Feb 2, 20264 min read - 175Engineering
Embedding model selection: the 5-minute decision tree
Jan 29, 20264 min read - 176Engineering
Claude Code + Stripe: revenue-aware development
Jan 28, 20264 min read - 177Engineering
Vector DB architecture: pgvector, managed, or homemade
Jan 26, 20264 min read - 178Engineering
RAG vs. fine-tuning: a 90% decision tree
Jan 22, 20264 min read - 179Engineering
Claude Code + Figma: design handoff in one prompt
Jan 21, 20264 min read - 180Engineering
Token economics: what your unit cost actually is
Jan 19, 20264 min read - 181Engineering
AI incident response: the postmortem template you'll wish you had
Jan 15, 20264 min read - 182Engineering
An AI-aware pull request template
Jan 8, 20265 min read - 183Engineering
Self-healing pipelines: the night shift you don't have to pay
Jan 5, 20264 min read - 184Engineering
Agent supervision loops: the OODA loop, re-implemented
Jan 2, 20264 min read - 185Engineering
EU AI Act: what changes in your engineering process
Dec 30, 20254 min read - 186Engineering
HIPAA and AI: the BAA is the first conversation
Dec 26, 20254 min read