Archives
- 30 Dec Modeling Voice AI Agents For Scale: A Kubernetes Approach
- 26 Dec Balancing Act: Using Structured Output with Streaming in LLMs
- 23 Dec Implementing Batched Queries with LLMs: Handling Large Datasets Efficiently
- 20 Dec Lessons from the Trenches: What I Learned Integrating LLMs in Production
- 10 Dec Taming Latency: Strategies to Handle Latencies in Scalable Systems
- 01 Oct Good coding practices enabling SOC2 Compliance
- 19 Sep Tests Over Types: Why Testing Trumps Type Hints in Python
- 02 Jul Resilient Systems: Managing Retries and Traffic Patterns
- 10 Jun Request Hedging: Fighting Tail Latencies
- 21 Apr The Power of Two Choices in Systems
- 16 Feb DORA Metrics: Simple Definitions and What They Measure
- 20 Jan Redis Locks and Fencing Tokens
- 19 Dec Redis Pipelining: The Powerful Feature You're Probably Not Using
- 15 Dec Do You Really Need Kubernetes? The Case for Simpler Infrastructure in Early-Stage Products
- 04 Oct Amdahl's Law & Little's Law: Foundational Principles for System Design
- 05 Apr Cuckoo Hashing: Elegant Collision Resolution
- 02 Oct Clean Database Migrations
- 15 Jun Beyond Booleans: Why Enums Should Be Your Go-To for State Management