…2065) Like with mt_bench, the new logic supports vllm and llama-cpp and passes the base url of the started instance to the mmlu library. This enables: - Consistency with how we are serving from a support perspective - Support for larger models with sharding across gpus - Multi-gpu support - Potential for faster performance with higher serving throughput Related: instructlab/eval#50 instructlab/eval#68 Corresponding Eval change: instructlab/eval#99 Resolves: #1792 **Checklist:** - [ ] **Commit Message Formatting**: Commit titles and messages follow guidelines in the [conventional commits](https://www.conventionalcommits.org/en/v1.0.0/#summary). - [ ] [Changelog](https://github.com/instructlab/instructlab/blob/main/CHANGELOG.md) updated with breaking and/or notable changes for the next minor release. - [ ] Documentation has been updated, if necessary. - [x] Unit tests have been added, if necessary. - [ ] Integration tests have been added, if necessary. Approved-by: nathan-weinberg Approved-by: alimaredia Approved-by: leseb

mmlu isn't consuming multiple gpus #68

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions