<Instruction> WizardLM: Empowering Large Language Models to Follow Complex Instructions

최근(2023.04)에 나온 논문을 읽어보고 간단히 정리했습니다. 혹시 부족하거나 잘못된 내용이 있다면 댓글 부탁드립니다 ️ usechatgpt init success [Microsoft] 대량의 instruction data를 생성하는 방법론 Evol-Instruct을 제시. 이를 이용해 생성한 데이터셋으로 fine-tuning한 모델 WizardLM이 Alpaca, Vicuna를 압도. 배경 LLM이 instruction data를 활용하는 경우, 그 성능이 눈에 띄게 좋아진다는 것은 잘 알려져 있습니다. 우리에게 익숙한 ChatGPT도 이를 적극적으로 잘 활용하여 학습된 모델이죠. 예전에는 instruction data라고 해봤자, 특정 도메인에 한정되고(closed-domain) 아주 간단한..

원문링크 : <Instruction> WizardLM: Empowering Large Language Models to Follow Complex Instructions

<Instruction> WizardLM: Empowering Large Language Models to Follow Complex Instructions

등록된 다른 글

<Prompting, Decomposition> Least-to-Most Prompting Enables Complex Reasoning in Large Language Models (2023.04)

Matrix Inverses

[프로그래머스] 숫자 짝꿍(Python)

Taylor series for approximations(1)

Shallow Neural Network(3)

Face Recognition(3) : Siamese Network

<LLM> [Google DeepMind] Gemma: Open Models Based on GeminiResearch and Technology (2024.02)

[BOJ] 1149 : RGB거리 [다이나믹 프로그래밍](Python)

키자드 로그인

키자드

네이버 블로그

티스토리

커뮤니티