1 file changed
+1
-1
lines changedSubmodule alpaca_eval updated 14 files
- docs/data_AlpacaEval/alpaca_eval_gpt4_leaderboard.csv+1
- docs/data_AlpacaEval_2/weighted_alpaca_eval_gpt4_turbo_leaderboard.csv+5-3
- notebooks/length_correction.ipynb+4.6k-3.9k
- results/Mistral-7B-ReMax-v0.1/alpaca_eval_gpt4_turbo_fn/annotations.json+11.3k
- results/Mistral-7B-ReMax-v0.1/model_outputs.json+5.6k
- results/Mistral-7B-ReMax-v0.1/weighted_alpaca_eval_gpt4_turbo/annotations.json+64.9k
- results/gpt4_gamed/model_outputs.json+4.8k
- results/gpt4_gamed/weighted_alpaca_eval_gpt4_turbo/annotations.json+63.4k
- src/alpaca_eval/leaderboards/data_AlpacaEval/alpaca_eval_gpt4_leaderboard.csv+1
- src/alpaca_eval/leaderboards/data_AlpacaEval_2/weighted_alpaca_eval_gpt4_turbo_leaderboard.csv+4-2
- src/alpaca_eval/models_configs/Mistral-7B-ReMax-v0.1/configs.yaml+13
- src/alpaca_eval/models_configs/Mistral-7B-ReMax-v0.1/prompt.txt+1
- src/alpaca_eval/models_configs/Qwen1.5-72B-Chat/configs.yaml+2-1
- src/alpaca_eval/models_configs/gpt4_gamed/configs.yaml+8
0 commit comments