GTEdgeAI/gt-ai-os-community

Fork 0

Go to file

daniel 014e5c7f23

Build and Push Multi-Arch Docker Images / Build control-panel-backend (amd64) (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Build control-panel-frontend (amd64) (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Build resource-cluster (amd64) (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Build tenant-app (amd64) (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Build tenant-backend (amd64) (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Build control-panel-backend (arm64) (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Build control-panel-frontend (arm64) (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Build resource-cluster (arm64) (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Build tenant-app (arm64) (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Build tenant-backend (arm64) (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Create multi-arch manifest for control-panel-backend (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Create multi-arch manifest for control-panel-frontend (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Create multi-arch manifest for resource-cluster (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Create multi-arch manifest for tenant-app (push) Has been cancelled

Build and Push Multi-Arch Docker Images / Create multi-arch manifest for tenant-backend (push) Has been cancelled

Update README.md

2026-01-10 04:11:15 +00:00

.deployment/docker

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

.github

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

apps

fix: remove SSRF protection for Community Edition local deployments

2025-12-15 16:18:24 -05:00

demo-agents

fix: remove nvidia-ctk runtime configure from DGX installer (conflicts with DGX OS override.conf)

2025-12-12 18:42:53 -05:00

installers

delete mac

2026-01-10 04:00:18 +00:00

packages

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

scripts

fix: correct max_tokens for Groq models

2025-12-16 10:06:29 -05:00

.env.template

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

.gitignore

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

CODE_OF_CONDUCT.md

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

CONTRIBUTING.md

Update CONTRIBUTING.md

2025-12-21 14:56:15 -05:00

docker-compose.arm64.yml

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

docker-compose.dev.yml

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

docker-compose.dgx.yml

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

docker-compose.x86-gpu.yml

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

docker-compose.x86.yml

fix: reduce RAM requirements for Mac/Ubuntu installers to 8GB

2025-12-29 13:16:10 -05:00

docker-compose.yml

fix: sync tenant DB password from env during init

2025-12-12 19:52:32 -05:00

LICENSE

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

package-lock.json

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

package.json

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

README.md

Update README.md

2026-01-10 04:11:15 +00:00

SECURITY.md

GT AI OS Community Edition v2.0.33

2025-12-12 17:04:45 -05:00

README.md

GT AI OS Community Edition

GT AI OS software is intended to provide easy to use "daily driver" web based generative AI for processing documents & files with data privacy for individuals and organizations. You can install GT AI OS on Ubuntu x86 and NVIDIA DGX OS 7 ARM hosts using Docker.

Start Installation

Minimum 4 CPU cores, 16GB RAM and 50GB SSD storage required for the application. GT AI OS will usually use about 7GB RAM when fully installed.

Local models, conversation history and datasets will consume additional SSD or disk storage.

The provided runbooks are intended to provide a smooth installation and include commands for dependencies. Open an issue on the repo if you have problems with the runbooks.

Build and deploy custom generative AI agents and bring-your-own local or external API inference via NVIDIA NIM, Ollama, Groq, vLLM, SGLang and more.

GT AI OS is ideal for working with documents and files that need data privacy. It is not multimodal and can't generate or process images, videos or audio as of version 2.0.33.

Ensure that you are using local or external inference with zero data retention features if you want your data to remain private.

GT AI OS Wiki

Supported Platforms

Platform	Host Architecture
Ubuntu Linux 24.04	x86_64
NVIDIA DGX OS 7 (Optimized for Grace Blackwell Architecture)	ARM64

Ubuntu VM's running on Proxmox with raw all functions GPU passthrough works. Windows is currently not supported.

Note that the install scripts are unique for each OS and hardware architecture. Carefully choose the correct installation script for your host.

Embedding model GPU acceleration:

NVIDIA GPU's will significantly accelerate embedding acceleration (uploading files and documents for Retrieval Augemented Generation "RAG"). As of release 2.0.34 the minimum GPU VRAM needed at installation time is 4GB as the embedding model installed is teh BAAI/bge-m3 which consumes around 3.78GB once fully loaded onto the GPU. We will be adjusting the installation scripts in future release so that smaller GPU's down to 1GB can be used on mini desktop computers.

Ensure that your NVIDIA GPU hardware is physically installed prior to starting the GT AI OS installation. Note that all NVIDIA drivers and dependencies will be installed during the standard Ubuntu runbook.

At v2.0.34, once you install GT AI OS, you cannot install GPU hardware and switch from CPU to GPU for embeddings. We are looking to fix this in a future release.

If you do not have an NVIDIA GPU installed in your host, then the CPU and host RAM will be used for running the embedding model. CPU vs GPU accelerated embedding will result in slower file uploads when adding files to datasets.

Features

AI Agent Builder - Create custom AI agents with your own system prompts, categorization, role base access and guardrails
Local Model Support - Run local AI models with Ollama (completely offline)
Document Processing - Upload documents into datasets and create agents to interact with them
Create Teams - For setting up a workgroup that has Team based access to agents and dataasets
Observability - See metrics dashboards including agents, models and dataset usage, chat logs and more

Documentation

Topic	Description
Installation	Detailed setup instructions
Updating	Keep GT AI OS up to date
NVIDIA NIM Setup	Enterprise GPU-accelerated inference
Ollama Setup	Set up local AI models
Groq Cloud Setup	Ultra-fast cloud inference
Cloudflare Tunnel	Access GT AI OS from anywhere
Troubleshooting	Common issues and solutions

Community vs Enterprise

Feature	Community (Free)	Enterprise (Paid)
Users	Up to 10 users	User licenses per seat
Support	GitHub Issues	Dedicated human support
Billing & Reports	Not included	Full financial tracking
Pro Agents	Not included	Pre-built professional agents
AI Inference	BYO/DIY	Fully Managed
Setup	DIY	Fully Managed
Uptime Guarantee	Self	99.99% uptime SLA

Want Enterprise? Contact GT Edge AI

Architecture

┌────────────────────────────────────────────────────────────────┐
│                          GT AI OS                              │
├──────────────────┬──────────────────────┬──────────────────────┤
│   Control Panel  │      Tenant App      │   Resource Cluster   │
│    (Admin UI)    │       (User UI)      │(AI Inference Routing)│
├──────────────────┴──────────────────────┴──────────────────────┤
│                          Postgres DB                            │
│                  Control DB  │  Tenant DB                      │
└────────────────────────────────────────────────────────────────┘

Bug and issue reporting:

Found a bug? Have an idea? Open an issue: https://github.com/GT-Edge-AI-Internal/gt-ai-os-community/issues

Security

Found a security issue? Report via our contact form

See SECURITY.md for our security policy.

License

Apache License 2.0 - See LICENSE

GT AI OS Community Edition | Made by GT Edge AI

Description

The easy to use out of the box AI platform with RBAC, Agentic RAG-powered document processing, and LLM Integrated custom Agent creation.

agents ai chatbot docker embeddings enterprise fastapi groq llm nextjs nvidia ollama pgvector rag

Readme Apache-2.0 2.9 MiB

Languages

Python 54.5%

TypeScript 41%

PLpgSQL 2.3%

Shell 1.8%

JavaScript 0.2%

Other 0.1%