Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for Granite 3.1 model family (IBM) #3261

Merged
merged 11 commits into from
Jan 13, 2025

Conversation

RonalddMatias
Copy link
Contributor

Description

This Pull Request adds support for both the base and instruct versions of the Granite 3.1 model family developed by IBM, which are released as open-source on the Hugging Face platform. These additions expand the options available for various NLP tasks and text generation scenarios.

Main Changes

Added both base and instruct models from the Granite 3.1 family (open-source) to the list of supported models.
Adjusted configuration files to accommodate the specific parameters for these new models.

Benchmarks Executed

The Granite 3.1 models, including both the base and instruct versions, were tested on various benchmarks, including MMLU (Massive Multitask Language Understanding) and HumanEval for code generation. These models demonstrated competitive performance compared to other models of similar scale.

@yifanmai
Copy link
Collaborator

This looks great, thank you!

@yifanmai yifanmai merged commit b9ad574 into stanford-crfm:main Jan 13, 2025
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants