Accelerator Agents

Accelerator Agents is a collection of AI-powered tools designed to accelerate machine learning development on Google Cloud TPUs. This repository hosts agents that assist with code migration, kernel optimization, and performance tuning, enabling developers to leverage the full power of TPUs with greater velocity.

Disclaimer: This is not an officially supported Google product. This project is not eligible for the Google Open Source Software Vulnerability Rewards Program.

Overview

As machine learning models grow in complexity, optimizing them for specific hardware accelerators like TPUs becomes increasingly challenging. This project aims to provide a suite of "Agents"—specialized AI tools powered by Gemini—to automate and assist with these complex tasks.

The project includes two primary agents:

1. MaxCode

The MaxCode agent facilitates the conversion of existing PyTorch models and codebases into JAX. It is designed to help users migrate their workloads to run efficiently on TPUs, leveraging high-performance frameworks like MaxText.

Note: MaxCode is under active development, and we are continuously working to improve migration quality and expand model coverage.

Features:

Automated Conversion: Converts functional code blocks and model layers from PyTorch to JAX.
MaxText Integration: Generates JAX code compatible with the MaxText framework for immediate training and inference on TPUs.
Human-in-the-Loop: Designed to draft initial implementations that developers can review and refine.

2. MaxKernel

The MaxKernel agent is a specialized tool for high-performance kernel development on TPUs. It assists engineers in writing, optimizing, and debugging custom kernels, specifically focusing on Pallas (JAX's kernel language).

Features:

Kernel Writing: Drafts Pallas kernels from scratch or based on JAX reference implementations.
CUDA to Pallas Conversion: Assists in porting custom CUDA/GPU kernels to run optimally on TPUs.
Optimization & Profiling: Provides profiling insights and optimization suggestions to improve kernel performance (MFU).
Test Harness Generation: Automatically generates boilerplate code for correctness testing and compilation checks.

Getting Started

Prerequisites

A Google Cloud VM. A CPU-only VM is sufficient for MaxCode, while a TPU VM is recommended for MaxKernel.
Python 3.11+
Access to Gemini API (for agent reasoning capabilities).

Installation

Clone the repository: bash git clone https://github.com/AI-Hypercomputer/accelerator-agents.git cd accelerator-agents

(Note: Specific installation instructions for each agent can be found in their respective subdirectories.)

Contributing

We welcome contributions! Please see CONTRIBUTING.md for details on how to submit pull requests, report issues, and contribute to the project.

License

This project is licensed under the Apache License, Version 2.0. See LICENSE for the full license text.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
MaxCode		MaxCode
MaxKernel		MaxKernel
docs		docs
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Accelerator Agents

Overview

1. MaxCode

2. MaxKernel

Getting Started

Prerequisites

Installation

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Accelerator Agents

Overview

1. MaxCode

2. MaxKernel

Getting Started

Prerequisites

Installation

Contributing

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages