Super AGI Research Lab

Research lab dedicated to explore and pursue Generalized Super Intelligence

AI Employees Whitepaper

Neurosymbolic AI

Integrating symbolic reasoning with neural networks to achieve more abstract and human-like cognitive abilities.

Autonomous Agents & Multi-Agent Systems

Developing intelligent agents capable of independent decision-making and collaboration within complex environments.

Novel Model Architectures

Exploring innovative architectures and frameworks to enhance the capabilities and efficiency of AI systems.

System 2 Thinking

Investigating higher-order cognitive processes, such as reasoning, planning, and problem-solving, akin to human System 2 thinking.

Recursive Self-Improvement Systems

Designing AI systems capable of autonomously improving their own algorithms, architectures, and learning strategies over time.

Socio-Economic Research Areas

Digital Workforce

Examining the impact of AI and automation on employment dynamics, skill requirements, and the future of work in the digital age.

Algorithmic Governance

Studying the use of AI algorithms in decision-making processes within governance structures, and the associated implications for accountability, transparency, and fairness.

Universal Basic Income (UBI)

Investigating the potential role of UBI in mitigating the socio-economic effects of automation and AI-driven labor market shifts.

Ethical AI

Addressing ethical considerations and challenges in the development, deployment, and governance of advanced AI systems, with a focus on fairness, accountability, transparency, and societal impact.

Human-AI Collaboration

Exploring the dynamics of collaboration between humans and AI systems in various contexts, including work, education, healthcare, and creative endeavors.

Our Publications

GUIDE: Graphical User Interface Data for Execution

AUTONODE: A Neuro-Graphic Self-Learnable Engine for Cognitive GUI Automation

VEagle: Advancements in Multimodal Representation Learning

Recursive Agent Trajectory Fine-Tuning: Utilizing Agent Instructions for Enhanced Autonomy and Efficiency in AI Agents


  • Multi-Agent System

    All of us have heard about the Mixture-of-Experts (MoE) architecture for LLMs. MoE divides models into separate sub-networks (or “experts”), each specializing in a subset of the input data, to [...]

  • Meet Jake

    Meet Jake: The AI-Powered Market Research Agent

    Jake, crafted by SuperAGI, is an AI Market Research Agent designed to elevate the efficiency and accuracy of data analysis in market research. Distinct from standard data analysis tools, Jake [...]

  • Introducing DoRA : The Self Training Module of AutoNode

    Introduction In cognitive process automation, developing self-training modules is crucial. These modules can independently explore, learn, and adapt to complex and unfamiliar environments in the interface. They do this by [...]

  • A Deep Dive into Policy Optimization Algorithms & Frameworks for Model Alignment

    A Comprehensive Exploration of Policy Optimization Algorithms and Frameworks Introduction Reinforcement Learning (RL) is an intriguing area of machine learning that deals with the actions of intelligent agents within an [...]

  • Towards AGI Part 2: Multiverse of Actions

    Towards AGI: [Part 2] Multiverse of Actions

    In part-1 of Towards AGI series, we discussed a core component of Agents - Memory. However, the early agent architectures, didn’t have Memory as a first class primitive. As we [...]

  • Towards AGI: [Part 1] Agents with Memory

    Agents are an emerging class of artificial intelligence (AI) systems that use large language models (LLMs) to interact with the world. In the 'Towards AGI' series, we aim to explore [...]

Spotlight Papers

Research papers we are reading

  • ChatterBox: Multi-round Multimodal Referring and Grounding

  • KOSMOS-2: Grounding Multimodal Large Language Models to the World

  • CogVLM: Visual Expert for Pretrained Language Models

  • Contextual Object Detection with Multimodal Large Language Models

  • Incorporating Visual Experts to Resolve the Information Loss in Multimodal Large Language Models

  • CogAgent: A Visual Language Model for GUI Agents