research_statement

The End of Single-Threaded Intelligence

Scaling single-threaded model generation will end. We will hit context length limits and reach issues with context window pollution. The next frontier is multi-agent systems, under the guise of context management, multi-threaded models, or emergent coordination protocols.

Randy Ardywibowo

Jun 10, 2025 3 min read

Scaling Supervision with Intrinsic Rewards

“Intrinsic rewards are the self-supervised pretraining stage for reinforcement learning agents.” One of the most critical challenges in training AI systems today is finding scalable supervision signals. As models grow in complexity and their applications extend into increasingly nuanced domains, providing supervision to lead them towards performing well on a task becomes more difficult.

Randy Ardywibowo

Dec 11, 2024 3 min read