GitHub - wgryc/phasellm: Large language model evaluation and workflow framework from Phase AI.

github.com

GitHub - wgryc/phasellm: Large language model evaluation and workflow framework from Phase AI.

github.com

manitcor@lemmy.intai.techM to Machine Learning - Learning/Language Models@lemmy.intai.techEnglish · 1 year ago

Large language model evaluation and workflow framework from Phase AI. - GitHub - wgryc/phasellm: Large language model evaluation and workflow framework from Phase AI.

Docs: https://phasellm.com/docs/phasellm/eval.html

This project provides a unified framework to test generative language models on a large number of different evaluation tasks.

Features:

200+ tasks implemented. See the task-table for a complete list.
Support for models loaded via transformers (including quantization via AutoGPTQ), - GPT-NeoX, and Megatron-DeepSpeed, with a flexible tokenization-agnostic interface.
Support for commercial APIs including OpenAI, goose.ai, and TextSynth.
Support for evaluation on adapters (e.g. LoRa) supported in HuggingFace’s PEFT library.
Evaluating with publicly available prompts ensures reproducibility and comparability between papers.
Task versioning to ensure reproducibility when tasks are updated.

You must log in or # to comment.

Chat

Machine Learning - Learning/Language Models@lemmy.intai.tech

models@lemmy.intai.tech

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Discussion of models, thier use, setup and options.

Please include models used with your outputs, workflows optional.

Model Catalog

We follow Lemmy’s code of conduct.

Communities

Useful links

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
1 user / month
1 user / 6 months
2 local subscribers
1 subscriber
50 Posts
1 Comment
Modlog

mods:
manitcor@lemmy.intai.tech