<KD, Reasoning> [NAT] Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data (2023.12)

관심있는 NLP 논문을 읽어보고 ChatGPT를 이용하여 정리했습니다. 혹시 부족하거나 잘못된 내용이 있다면 댓글 부탁드립니다 ️ usechatgpt init success Abstract 대규모 언어 모델(Large Language Models, LLMs)은 다양한 추론 작업에서 뛰어난 성능을 보이지만, 많은 매개변수와 접근성 문제로 실제 적용에 어려움이 있음. LLM에서 생성된 사고 과정 체인을 활용하여 작은 모델로 추론 능력을 정제하는 방법이 유망함. 복잡한 수학 문제 등에서 LLM이 잘못된 추론 체인을 생성할 수 있음. 기존 연구는 정확한 답변을 포함한 긍정적 데이터만을 이용하고, 잘못된 답변을 포함한 데이터는 제외함. 본 연구에서는 부정적 데이터의 중요성을 제시하고, 긍정적 및 부정적 샘플..

원문링크 : <KD, Reasoning> [NAT] Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data (2023.12)

<KD, Reasoning> [NAT] Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data (2023.12)

등록된 다른 글

Building AI Projects(1) - Workflow of a machine learning project

[BOJ] 10799 : 쇠막대기 [자료구조](Python)

[프로그래머스] 짝지어 제거하기 (Python)

<LLM> [MoE] Mixtral of Experts (2024.01)

Fitting as minimisation problem

<CoT, Agent> ReAct: Synergizing Reasoning and Acting in Language Models (2022.10 → 2023.03)

[프로그래머스] 명예의 전당 (1) (Python)

<SLM> Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone (2024.04)

키자드 로그인

키자드

네이버 블로그

티스토리

커뮤니티