A new San Francisco-based startup, Deep Cogito, has unveiled its first family of AI models, Cogito 1, which can switch between fast-response and deep-reasoning modes instead of being limited to just one approach.
These hybrid models combine the efficiency of standard AI with the step-by-step problem-solving abilities seen in advanced systems like OpenAI's o1. While reasoning models excel in fields like maths and physics, they often require more computing power, a trade-off Deep Cogito aims to balance.
The Cogito 1 series, built on Meta's Llama and Alibaba's Qwen models instead of starting from scratch, ranges from 3 billion to 70 billion parameters, with larger versions planned.
Early tests suggest the top-tier Cogito 70B outperforms rivals like DeepSeek's reasoning model and Meta's Llama 4 Scout in some tasks. The models are available for download or through cloud APIs, offering flexibility for developers.
Founded in June 2024 by ex-Google DeepMind product manager Dhruv Malhotra and former Google engineer Drishan Arora, Deep Cogito is backed by investors like South Park Commons.
The company's ambitious goal is to develop 'general superintelligence,' AI that surpasses human capabilities, rather than merely matching them. For now, the team says they've only scratched the surface of their scaling potential.