Skip to main content
Redhat Developers  Logo
  • Products

    Featured

    • Red Hat Enterprise Linux
      Red Hat Enterprise Linux Icon
    • Red Hat OpenShift AI
      Red Hat OpenShift AI
    • Red Hat Enterprise Linux AI
      Linux icon inside of a brain
    • Image mode for Red Hat Enterprise Linux
      RHEL image mode
    • Red Hat OpenShift
      Openshift icon
    • Red Hat Ansible Automation Platform
      Ansible icon
    • Red Hat Developer Hub
      Developer Hub
    • View All Red Hat Products
    • Linux

      • Red Hat Enterprise Linux
      • Image mode for Red Hat Enterprise Linux
      • Red Hat Universal Base Images (UBI)
    • Java runtimes & frameworks

      • JBoss Enterprise Application Platform
      • Red Hat build of OpenJDK
    • Kubernetes

      • Red Hat OpenShift
      • Microsoft Azure Red Hat OpenShift
      • Red Hat OpenShift Virtualization
      • Red Hat OpenShift Lightspeed
    • Integration & App Connectivity

      • Red Hat Build of Apache Camel
      • Red Hat Service Interconnect
      • Red Hat Connectivity Link
    • AI/ML

      • Red Hat OpenShift AI
      • Red Hat Enterprise Linux AI
    • Automation

      • Red Hat Ansible Automation Platform
      • Red Hat Ansible Lightspeed
    • Developer tools

      • Red Hat Trusted Software Supply Chain
      • Podman Desktop
      • Red Hat OpenShift Dev Spaces
    • Developer Sandbox

      Developer Sandbox
      Try Red Hat products and technologies without setup or configuration fees for 30 days with this shared Openshift and Kubernetes cluster.
    • Try at no cost
  • Technologies

    Featured

    • AI/ML
      AI/ML Icon
    • Linux
      Linux Icon
    • Kubernetes
      Cloud icon
    • Automation
      Automation Icon showing arrows moving in a circle around a gear
    • View All Technologies
    • Programming Languages & Frameworks

      • Java
      • Python
      • JavaScript
    • System Design & Architecture

      • Red Hat architecture and design patterns
      • Microservices
      • Event-Driven Architecture
      • Databases
    • Developer Productivity

      • Developer productivity
      • Developer Tools
      • GitOps
    • Secure Development & Architectures

      • Security
      • Secure coding
    • Platform Engineering

      • DevOps
      • DevSecOps
      • Ansible automation for applications and services
    • Automated Data Processing

      • AI/ML
      • Data Science
      • Apache Kafka on Kubernetes
      • View All Technologies
    • Start exploring in the Developer Sandbox for free

      sandbox graphic
      Try Red Hat's products and technologies without setup or configuration.
    • Try at no cost
  • Learn

    Featured

    • Kubernetes & Cloud Native
      Openshift icon
    • Linux
      Rhel icon
    • Automation
      Ansible cloud icon
    • Java
      Java icon
    • AI/ML
      AI/ML Icon
    • View All Learning Resources

    E-Books

    • GitOps Cookbook
    • Podman in Action
    • Kubernetes Operators
    • The Path to GitOps
    • View All E-books

    Cheat Sheets

    • Linux Commands
    • Bash Commands
    • Git
    • systemd Commands
    • View All Cheat Sheets

    Documentation

    • API Catalog
    • Product Documentation
    • Legacy Documentation
    • Red Hat Learning

      Learning image
      Boost your technical skills to expert-level with the help of interactive lessons offered by various Red Hat Learning programs.
    • Explore Red Hat Learning
  • Developer Sandbox

    Developer Sandbox

    • Access Red Hat’s products and technologies without setup or configuration, and start developing quicker than ever before with our new, no-cost sandbox environments.
    • Explore Developer Sandbox

    Featured Developer Sandbox activities

    • Get started with your Developer Sandbox
    • OpenShift virtualization and application modernization using the Developer Sandbox
    • Explore all Developer Sandbox activities

    Ready to start developing apps?

    • Try at no cost
  • Blog
  • Events
  • Videos

Why Models-as-a-Service architecture is ideal for AI models

June 30, 2025
Ritesh Shah Juliano Mohr Ishu Verma Karl Eklund Guillaume Moutier Erwan Granger
Related topics:
Artificial intelligenceGitOpsSecuritySummit 2025
Related products:
Red Hat 3scale API ManagementRed Hat AIRed Hat OpenShift AIRed Hat OpenShift GitOpsRed Hat OpenShiftRed Hat Single sign-on

Share:

    The Models-as-a-Service (MaaS) platform leverages Red Hat OpenShift AI, Red Hat 3scale API Management, and Red Hat Single Sign-On to create a secure and scalable environment for AI models. OpenShift AI provides the foundational platform for the AI/ML lifecycle, 3Scale manages API access and security, and Red Hat SSO ensures centralized authentication and authorization. The vLLM powers model execution with its efficiency and speed. The architecture supports AI governance, zero-trust access, and hybrid cloud flexibility, creating a cohesive and high-performing ecosystem for deploying and managing AI models effectively.

    Follow this series:

    1. Part 1, 6 benefits of Models-as-a-Service for enterprises, is an introduction to MaaS for enterprises.
    2. This article explores broad architectural details and why enterprises need MaaS.
    3. Part 3 explains how to implement MaaS in an enterprise and its various components.
    4. Part 4 discusses inference optimization, scalability, and security aspects for large model deployments.

    The MaaS architecture solution

    Building a scalable and efficient MaaS platform demands a thoughtfully constructed and resilient architecture that integrates a diverse array of critical components. This architecture is not about throwing technology together. It’s about creating a cohesive and high-performing ecosystem.

    The core of this robust MaaS solution stack consists of a combination of leading-edge technologies. OpenShift AI serves as the foundation, providing a comprehensive platform for the entire AI/ML lifecycle. It handles everything from model training and development to deployment and monitoring. Complementing this is the 3Scale API Gateway, a crucial component of Red Hat 3scale API Management. The API Gateway is essential for managing access, controlling traffic, and ensuring the security of the AI models as they are exposed as services. Red Hat SSO further bolsters security by providing centralized authentication and authorization, enabling simplified and secure user access to the platform.

    Powering the actual model execution is an AI inference server, specifically vLLM. The vLLM is known for its speed and efficiency, making it an excellent choice for handling the demands of real-time AI inferencing. This combination offers a complete and end-to-end solution. It brings together AI governance, ensuring that models are used ethically. 

    Zero-trust access is also a key point. Every user and device is rigorously authenticated and authorized before gaining access. Hybrid cloud flexibility is another benefit, allowing the platform to operate seamlessly across different environments, whether it's on-premises, in the cloud, or a mix of both. All of this is delivered on a single, unified platform with consistent tooling, which greatly simplifies management and operations.

    For a clear visual representation of the Model-as-a-Service solution, refer to the high-level architecture diagram Figure 1. This diagram provides an overview of the MaaS solution, illustrating how each component fits together and interacts to create a functional and scalable platform. 

    MaaS Architecture
    Figure 1: This diagram shows the MaaS architecture.

    It is important to note that Red Hat OpenShift, acting as a robust and consistent Kubernetes layer, forms the de facto platform for deploying MaaS across diverse environments (public clouds, private data centers, or at the edge). This provides the flexibility needed in modern hybrid cloud strategies. 

    Hybrid cloud advantages

    The MaaS architecture's reliance on a hybrid cloud model unlocks a suite of benefits for enterprises. The following advantages offer operational efficiency, security, and cost optimization:

    • Unparalleled freedom of choice and portability: The MaaS platform empowers businesses with the freedom to deploy models wherever needed. This eliminates concerns about vendor lock-in and ensures that AI workloads are fully portable and adaptable to changing business requirements.
    • Fortified security and policy consistency: By leveraging OpenShift as the consistent Kubernetes foundation alongside Red Hat SSO, the MaaS solution guarantees uniform security policies across all hybrid AI deployments. This ensures secure access to large language models (LLMs) regardless of their location and maintains consistent policies across all environments, providing a robust and unified security posture.
    • Optimized costs and enhanced resource utilization: The MaaS approach focuses on reducing costs by centralizing model inference services and avoiding the costly duplication of resources. This model allows enterprises to offer open source models and the necessary AI technology stack as a shared resource accessible across the entire organization. Self-hosting addresses critical data privacy concerns associated with relying on third-party models, which can incur substantial expenses when deployed at scale.
    • Strengthened data privacy and security: Organizations can uphold compliance with existing security, data, and privacy regulations by avoiding the use of third-party hosted models, which might inadvertently expose sensitive enterprise data to external entities.
    • Scalable and granular access management: Features, such as SSO for all internal AI portals and advanced session management for regulatory compliance, facilitate scalable and fine-grained access management across distributed hybrid environments. The architecture supports multi-tenancy, logically isolating environments through shared resources, which allows for the efficient management of multiple tenants, tenant administrators, and user access to APIs and administrative portals.
    • Streamlined operations and governance: IT departments gain the ability to consistently manage APIs across both cloud and on-premises environments through OpenShift integration and deploy dedicated API gateways for private LLM instances. This results in enhanced AI management and robust oversight and governance with features such as versioning and regression testing, leading to a more controlled and reliable deployment process.
    • Accelerated innovation cycles: The synergy of 3Scale, Keycloak, and OpenShift AI fosters accelerated innovation through managed access and APIs. Automating the 3Scale configuration via its operator, significantly streamlines the process of deploying and exposing new models, resulting in a quicker time to market and rapid deployment of innovative AI solutions.

    The integration of these components enables enterprises to develop a highly scalable and manageable MaaS platform, empowering developers to seamlessly integrate AI capabilities into their applications across a broad spectrum of diverse and distributed infrastructures. This facilitates a more agile, secure, and cost-effective approach to AI deployment and utilization.

    Next up

    You can refer to the detailed MaaS architecture. You may also want to review this arcade video presentation. In part 3 of this series, we'll discuss the various components of MaaS and explore how to implement it for an enterprise.

     

    Related Posts

    • How to size your projects for Red Hat's single sign-on technology

    • Model training in Red Hat OpenShift AI

    • Packaging APIs for consumers with Red Hat 3scale API Management

    • Red Hat OpenShift AI and machine learning operations

    Recent Posts

    • Storage considerations for OpenShift Virtualization

    • Upgrade from OpenShift Service Mesh 2.6 to 3.0 with Kiali

    • EE Builder with Ansible Automation Platform on OpenShift

    • How to debug confidential containers securely

    • Announcing self-service access to Red Hat Enterprise Linux for Business Developers

    What’s up next?

    Learn how to use Red Hat OpenShift AI to quickly develop, train, and deploy machine learning models. This hands-on guide walks you through setting up a Jupyter notebook environment and running sample code in a JupyterLab Integrated Development Environment (IDE) in the Developer Sandbox.

    Start the activity
    Red Hat Developers logo LinkedIn YouTube Twitter Facebook

    Products

    • Red Hat Enterprise Linux
    • Red Hat OpenShift
    • Red Hat Ansible Automation Platform

    Build

    • Developer Sandbox
    • Developer Tools
    • Interactive Tutorials
    • API Catalog

    Quicklinks

    • Learning Resources
    • E-books
    • Cheat Sheets
    • Blog
    • Events
    • Newsletter

    Communicate

    • About us
    • Contact sales
    • Find a partner
    • Report a website issue
    • Site Status Dashboard
    • Report a security problem

    RED HAT DEVELOPER

    Build here. Go anywhere.

    We serve the builders. The problem solvers who create careers with code.

    Join us if you’re a developer, software engineer, web designer, front-end designer, UX designer, computer scientist, architect, tester, product manager, project manager or team lead.

    Sign me up

    Red Hat legal and privacy links

    • About Red Hat
    • Jobs
    • Events
    • Locations
    • Contact Red Hat
    • Red Hat Blog
    • Inclusion at Red Hat
    • Cool Stuff Store
    • Red Hat Summit
    © 2025 Red Hat

    Red Hat legal and privacy links

    • Privacy statement
    • Terms of use
    • All policies and guidelines
    • Digital accessibility

    Report a website issue