/blog/research

Research & Field Notes

Technical thinking, architecture notes, and operating lessons from production AI and applied systems work.

Start a project
6 posts
local ai vs cloud based
Research
May 27, 2026

Can Local AI Agents Do Real Office Work?

A practical benchmark comparing a local NVIDIA GB10 AI agent running Qwen3.6 35B through Hermes against GPT-5.5 on real office workflows, including code repair, document synthesis, web edits, travel research, and concurrency limits.

4090 vs GB10
Research
May 15, 2026

RTX 4090 vs NVIDIA GB10 for Local AI: Speed When It Fits, Headroom When It Doesn’t

A practical RTX 4090 vs NVIDIA GB10 local AI benchmark comparing Qwen3.6 35B Q4, Q5, MXFP4, long-context workloads, and dense 70B inference. The results show where the RTX 4090 dominates, where GB10’s memory headroom matters, and why the VRAM cliff is one of the most important limits in local AI hardware.

Image of GB10
Research
May 13, 2026

Qwen3.6-35B on NVIDIA GB10: 243 llama.cpp Runs to Find the Best Local Quant

A detailed benchmark of Qwen3.6-35B-A3B on NVIDIA GB10 using 243 llama.cpp runs, comparing BF16, Q8, Q5, Q4, and MXFP4 GGUF variants for local AI inference, RAG, agents, and long-context workloads.

Cover Image
Research
Apr 8, 2026

Gemma 4 on NVIDIA GB10: Quantization Benchmarks for Local Inference

A hands-on benchmark of Gemma 4 on NVIDIA GB10 comparing 31B dense and 26B-A4B MoE variants across speed, memory, thinking mode, and practical deployment tradeoffs.

Research
Research
Mar 10, 2026

Building Agentic Pipelines at Scale

A deep dive into how we architect autonomous AI agent systems for enterprise clients — from task decomposition to fault-tolerant execution.

Research
Research
Feb 20, 2026

AI Policy Frameworks Every Organization Needs

As AI adoption accelerates, organizations need governance frameworks that enable innovation while managing risk. Here's our approach.