<LLM> Lost in the Middle: How Language Models Use Long Contexts

최근(2023.07)에 나온 논문을 읽어보고 간단히 정리했습니다. 혹시 부족하거나 잘못된 내용이 있다면 댓글 부탁드립니다 ️ usechatgpt init success [Standford University] multi-document QA와 key-value retireval에서, query와 관련된 정보가 context의 시작, 또는 끝에 위치하는 것이 유리하다. 이 경향은 context가 길어질수록 명확해지기 때문에, context의 길이를 x축으로 삼고 모델 성능을 y축으로 삼는 그래프는 U자 curve로 그려진다. 배경 최근 LLM을 언급하면 빠질 수 없는 이야기는 처리 가능한 입력 길이입니다. 이를 늘리기 위해서 다양한 연구가 이뤄지고 있는데, 실제로 참조해야 할 문서가 많아질수록 모델의..

원문링크 : <LLM> Lost in the Middle: How Language Models Use Long Contexts

<LLM> Lost in the Middle: How Language Models Use Long Contexts

등록된 다른 글

Non-technical explanation of deep learning(Part 1,2 optional)

<Instruction> Self-Alignment with Instruction Backtranslation

Union-Find(1) : Dynamic Connectivity

<KD, Hallucination> [Idk Dataset] Can AI Assistants Know What They Don't Know? (2024.01)

[프로그래머스] 덧칠하기 (Python)

<PEFT> ResLoRA: Identity Residual Mapping in Low-Rank Adaption (2024.02)

GSAT 온라인 모의고사

GPT-4의 토큰별 예측 확률을 확인할 수 있을까? (부분적으로 가능하다!)

키자드 로그인

키자드

네이버 블로그

티스토리

커뮤니티