Skip to content

Add checkpoint conversion support for Qwen3-4b-base and Qwen3-8b-base.#3557

Open
niting wants to merge 1 commit intoAI-Hypercomputer:mainfrom
niting:qwen3-hf-support
Open

Add checkpoint conversion support for Qwen3-4b-base and Qwen3-8b-base.#3557
niting wants to merge 1 commit intoAI-Hypercomputer:mainfrom
niting:qwen3-hf-support

Conversation

@niting
Copy link
Copy Markdown

@niting niting commented Apr 2, 2026

Description

Add support for converion of HF Qwen3-4B-Base and Qwen3-8B-Base checkpoints to
MaxText formats.

Tests

Ran the conversion script successfully.

Checklist

Before submitting this PR, please make sure (put X in square brackets): - [X] I
have performed a self-review of my code. For an optional AI review, add the
gemini-review label. - [X] I have necessary comments in my code, particularly
in hard-to-understand areas. - [X] I have run end-to-end tests tests and
provided workload links above if applicable. - [X] I have made or will make
corresponding changes to the doc if needed, including adding new documentation
pages to the relevant Table of Contents (toctree directive) as explained in
our documentation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants