OpenSSM – “Small Specialist Models” for Industrial AI

  See full documentation at aitomatic.github.io/openssm/.  

OpenSSM (pronounced open-ess-ess-em) is an open-source framework for Small Specialist Models (SSMs), which are key to enhancing trust, reliability, and safety in Industrial-AI applications. Harnessing the power of domain expertise, SSMs operate either alone or in "teams". They collaborate with other SSMs, planners, and sensors/actuators to deliver real-world problem-solving capabilities.

Unlike Large Language Models (LLMs), which are computationally intensive and generalized, SSMs are lean, efficient, and designed specifically for individual domains. This focus makes them an optimal choice for businesses, SMEs, researchers, and developers seeking specialized and robust AI solutions for industrial applications.

SSM in Industrial AI

A prime deployment scenario for SSMs is within the aiCALM (Collaborative Augmented Large Models) architecture. aiCALM represents a cohesive assembly of AI components tailored for sophisticated problem-solving capabilities. Within this framework, SSMs work with General Management Models (GMMs) and other components to solve complex, domain-specific, and industrial problems.

Why SSM?

The trend towards specialization in AI models is a clear trajectory seen by many in the field.

  Specialization is crucial for quality .. not general purpose Al models – Eric Schmidt, Schmidt Foundation  

  .. small models .. for a specific task that are good – Matei Zaharia, Databricks  

  .. small agents working together .. specific and best in their tasks – Harrison Chase, Langchain  

  .. small but highly capable expert models – Andrej Karpathy, OpenAI  

  .. small models are .. a massive paradigm shift .. about deploying AI models at scale – Rob Toews, Radical Ventures  

As predicted by Eric Schmidt and others, we will see “a rich ecosystem to emerge [of] high-value, specialized AI systems.” SSMs are the central part in the architecture of these systems.

What OpenSSM Offers

OpenSSM fills this gap directly, with the following benefits to the community, developers, and businesses:

Target Audience

Our primary audience includes:

SSM Architecture

At a high level, SSMs comprise a front-end Small Language Model (SLM), an adapter layer in the middle, and a wide range of back-end domain-knowledge sources. The SLM itself is a small, efficient, language model, which may be domain-specific or not, and may have been distilled from a larger model. Thus, domain knowledge may come from either, or both, the SLM and the backends.

High-Level SSM Architecture

The above diagram illustrates the high-level architecture of an SSM, which comprises three main components:

  1. Small Language Model (SLM): This forms the communication frontend of an SSM.

  2. Adapters (e.g., LlamaIndex): These provide the interface between the SLM and the domain-knowledge backends.

  3. Domain-Knowledge Backends: These include text files, documents, PDFs, databases, code, knowledge graphs, models, other SSMs, etc.

SSMs communicate in both unstructured (natural language) and structured APIs, catering to a variety of real-world industrial systems.

SSM Composability

The composable nature of SSMs allows for easy combination of domain-knowledge sources from multiple models.

Getting Started

See our Getting Started Guide for more information.

Roadmap

Community

Join our vibrant community of AI enthusiasts, researchers, developers, and businesses who are democratizing industrial AI through SSMs. Participate in the discussions, share your ideas, or ask for help on our Community Discussions.

Contribute

OpenSSM is a community-driven initiative, and we warmly welcome contributions. Whether it's enhancing existing models, creating new SSMs for different industrial domains, or improving our documentation, every contribution counts. See our Contribution Guide for more details.

License

OpenSSM is released under the Apache 2.0 License.