Skip to content

#Node.js

33 articles

LLM Prompt Caching: Cut API Costs 90%
AI/ML Engineering

LLM Prompt Caching: Cut API Costs 90%

Prompt caching cuts LLM API bills 50-90% by reusing the KV cache for stable prefixes. Anthropic, OpenAI, Gemini, and vLLM compared with real pricing, implementation patterns, and four workload simulations.

15 min read·
Build Your First MCP Server in TypeScript
AI/ML Engineering

Build Your First MCP Server in TypeScript

Step-by-step tutorial to build an MCP server in TypeScript with @modelcontextprotocol/sdk and Zod. Three tools, stdio transport, Inspector debugging, Claude Desktop/Cursor integration, and npm publish.

16 min read·
Eval-Driven Development for LLM Apps: A Practical Workflow
AI/ML Engineering

Eval-Driven Development for LLM Apps: A Practical Workflow

TDD doesn't work on non-deterministic LLM outputs. Eval-driven development is the analog: paired prompt-expectation sets, rule-based or LLM-judge scoring, run as a regression suite on every change. Tools (promptfoo, Braintrust, OpenAI Evals), real customer-support example, and what evals catch that code review misses.

12 min read·
Node.js Performance Tuning: Handle 10x More Requests
Architecture

Node.js Performance Tuning: Handle 10x More Requests

Go from 800 to 15,000+ requests/second with clustering, Fastify, connection pooling, Redis caching, and event loop optimization. Real benchmarks included.

10 min read·
The Self-Propagating npm Worm (April 2026): How postinstall Hooks Got Weaponized
Security

The Self-Propagating npm Worm (April 2026): How postinstall Hooks Got Weaponized

April 2026's self-propagating npm worm used postinstall hooks to scrape developer tokens (npm, GitHub, AWS), then auto-publish backdoored package versions. Detection steps, rotation playbook, and the structural defenses (ignore-scripts, pnpm, sandboxed CI, signed publishes).

12 min read·
How to Scale a Next.js App to 100k Users Without Breaking the Bank
Architecture

How to Scale a Next.js App to 100k Users Without Breaking the Bank

A practical playbook for scaling Next.js apps to 100k users using ISR, multi-layer caching, CDN optimization, and backend tuning -- all for under $500/month.

9 min read·
What Is MCP (Model Context Protocol)? A Developer's Guide
AI/ML Engineering

What Is MCP (Model Context Protocol)? A Developer's Guide

A practical guide to the Model Context Protocol (MCP) -- Anthropic's open standard for connecting AI models to external tools and data. Architecture, TypeScript server examples, and security best practices.

14 min read·
The Vercel April 2026 Edge Function Incident: What Leaked, What to Do
Security

The Vercel April 2026 Edge Function Incident: What Leaked, What to Do

Vercel's April 6-9 edge function incident exposed env-vars and secrets in a subset of deployments. Timeline, scope, what was actually exposed, the rotation playbook for affected teams, and the structural defenses that prevent this next time.

11 min read·
Serverless vs Containers: Choosing the Right Compute Model
Cloud

Serverless vs Containers: Choosing the Right Compute Model

Compare serverless (Lambda, Cloud Run) and containers (ECS, EKS, Fargate) on cold starts, pricing, scaling, vendor lock-in, and local development. Learn when to use each compute model.

8 min read·

Stay in the loop

New articles delivered to your inbox. No spam.