Good news, a pity that they compared with GPT-3.5 but it will probably also be true for the next generation of models.
"Our analysis shows that fine-tuning improves the performance of open-source LLMs, allowing them to match or even surpass zero-shot GPT 3.5 and GPT-4, though still lagging behind fine-tuned GPT
3.5. "
https://link.springer.com/article/10.1007/s42001-024-00345-9
#opensource #LLM #AI #finetuning