TechStudy/LLM
2024. 5. 13.
Llama3 and ORPO (리딩용)
The Odds Ratio term in ORPO is used to calculate the likelihood of a model generating an output sequence y given an input sequence x. This value indicates that the model is n times more likely to generate the sequence y than not. The odds ratio of chosen responses over rejected responses measures the model’s likelihood of generating chosen responses.The log of this odds ratio is considered becau..