#Node.js

33 articles

LLM Prompt Caching: Cut API Costs 90%

Prompt caching cuts LLM API bills 50-90% by reusing the KV cache for stable prefixes. Anthropic, OpenAI, Gemini, and vLLM compared with real pricing, implementation patterns, and four workload simulations.

15 min read·Apr 5, 2026

AI/ML Engineering

Build Your First MCP Server in TypeScript

Step-by-step tutorial to build an MCP server in TypeScript with @modelcontextprotocol/sdk and Zod. Three tools, stdio transport, Inspector debugging, Claude Desktop/Cursor integration, and npm publish.

16 min read·Apr 5, 2026

AI/ML Engineering

Eval-Driven Development for LLM Apps: A Practical Workflow

TDD doesn't work on non-deterministic LLM outputs. Eval-driven development is the analog: paired prompt-expectation sets, rule-based or LLM-judge scoring, run as a regression suite on every change. Tools (promptfoo, Braintrust, OpenAI Evals), real customer-support example, and what evals catch that code review misses.

12 min read·Apr 5, 2026

Architecture

Node.js Performance Tuning: Handle 10x More Requests

Go from 800 to 15,000+ requests/second with clustering, Fastify, connection pooling, Redis caching, and event loop optimization. Real benchmarks included.

10 min read·Mar 25, 2026

Security

The Self-Propagating npm Worm (April 2026): How postinstall Hooks Got Weaponized

April 2026's self-propagating npm worm used postinstall hooks to scrape developer tokens (npm, GitHub, AWS), then auto-publish backdoored package versions. Detection steps, rotation playbook, and the structural defenses (ignore-scripts, pnpm, sandboxed CI, signed publishes).

12 min read·Mar 18, 2026

Architecture

How to Scale a Next.js App to 100k Users Without Breaking the Bank

A practical playbook for scaling Next.js apps to 100k users using ISR, multi-layer caching, CDN optimization, and backend tuning -- all for under $500/month.

9 min read·Mar 16, 2026

AI/ML Engineering

What Is MCP (Model Context Protocol)? A Developer's Guide

A practical guide to the Model Context Protocol (MCP) -- Anthropic's open standard for connecting AI models to external tools and data. Architecture, TypeScript server examples, and security best practices.

14 min read·Mar 15, 2026

Security

The Vercel April 2026 Edge Function Incident: What Leaked, What to Do

Vercel's April 6-9 edge function incident exposed env-vars and secrets in a subset of deployments. Timeline, scope, what was actually exposed, the rotation playbook for affected teams, and the structural defenses that prevent this next time.

11 min read·Mar 15, 2026

Cloud

Serverless vs Containers: Choosing the Right Compute Model

Compare serverless (Lambda, Cloud Run) and containers (ECS, EKS, Fargate) on cold starts, pricing, scaling, vendor lock-in, and local development. Learn when to use each compute model.

8 min read·Mar 5, 2026

← NewerPage 2 of 4Older →

LLM Prompt Caching: Cut API Costs 90%

Build Your First MCP Server in TypeScript

Eval-Driven Development for LLM Apps: A Practical Workflow

Node.js Performance Tuning: Handle 10x More Requests

The Self-Propagating npm Worm (April 2026): How postinstall Hooks Got Weaponized

How to Scale a Next.js App to 100k Users Without Breaking the Bank

What Is MCP (Model Context Protocol)? A Developer's Guide

The Vercel April 2026 Edge Function Incident: What Leaked, What to Do

Serverless vs Containers: Choosing the Right Compute Model

Stay in the loop