Llamafile

Llamafile은 단일 파일로 LLM을 배포하고 실행할 수 있게 해줍니다. Llamafile은 llama.cpp와 Cosmopolitan Libc를 하나의 프레임워크로 결합하여 LLM의 모든 복잡성을 단일 파일 실행 파일(“llamafile”이라고 함)로 압축합니다. 이 파일은 별도의 설치 없이 대부분의 컴퓨터에서 로컬로 실행됩니다.

Setup

사용하려는 모델의 llamafile을 다운로드합니다. HuggingFace에서 llamafile 형식의 많은 모델을 찾을 수 있습니다. 이 가이드에서는 작은 모델인 TinyLlama-1.1B-Chat-v1.0.Q5_K_M을 다운로드합니다. 참고: wget이 없는 경우 이 링크를 통해 모델을 다운로드할 수 있습니다.

wget https://huggingface.co/jartine/TinyLlama-1.1B-Chat-v1.0-GGUF/resolve/main/TinyLlama-1.1B-Chat-v1.0.Q5_K_M.llamafile

llamafile을 실행 가능하게 만듭니다. 먼저, 아직 하지 않았다면 터미널을 엽니다. MacOS, Linux 또는 BSD를 사용하는 경우, chmod를 사용하여 컴퓨터가 이 새 파일을 실행할 수 있도록 권한을 부여해야 합니다(아래 참조). Windows를 사용하는 경우, 파일 이름 끝에 “.exe”를 추가하여 파일 이름을 변경합니다(모델 파일 이름은 TinyLlama-1.1B-Chat-v1.0.Q5_K_M.llamafile.exe가 되어야 합니다).

chmod +x TinyLlama-1.1B-Chat-v1.0.Q5_K_M.llamafile  # run if you're on MacOS, Linux, or BSD

llamafile을 “server mode”로 실행합니다:

./TinyLlama-1.1B-Chat-v1.0.Q5_K_M.llamafile --server --nobrowser

이제 llamafile의 REST API를 호출할 수 있습니다. 기본적으로 llamafile server는 localhost:8080에서 수신 대기합니다. 전체 server 문서는 여기에서 찾을 수 있습니다. REST API를 통해 llamafile과 직접 상호작용할 수 있지만, 여기서는 LangChain을 사용하여 상호작용하는 방법을 보여드리겠습니다.

Usage

from langchain_community.llms.llamafile import Llamafile

llm = Llamafile()

llm.invoke("Tell me a joke")

'? \nI\'ve got a thing for pink, but you know that.\n"Can we not talk about work anymore?" - What did she say?\nI don\'t want to be a burden on you.\nIt\'s hard to keep a good thing going.\nYou can\'t tell me what I want, I have a life too!'

token을 streaming하려면 .stream(...) method를 사용합니다:

query = "Tell me a joke"

for chunks in llm.stream(query):
    print(chunks, end="")

print()

.
- She said, "I’m tired of my life. What should I do?"
- The man replied, "I hear you. But don’t worry. Life is just like a joke. It has its funny parts too."
- The woman looked at him, amazed and happy to hear his wise words. - "Thank you for your wisdom," she said, smiling. - He replied, "Any time. But it doesn't come easy. You have to laugh and keep moving forward in life."
- She nodded, thanking him again. - The man smiled wryly. "Life can be tough. Sometimes it seems like you’re never going to get out of your situation."
- He said, "I know that. But the key is not giving up. Life has many ups and downs, but in the end, it will turn out okay."
- The woman's eyes softened. "Thank you for your advice. It's so important to keep moving forward in life," she said. - He nodded once again. "You’re welcome. I hope your journey is filled with laughter and joy."
- They both smiled and left the bar, ready to embark on their respective adventures.

Edit the source of this page on GitHub.

Connect these docs programmatically to Claude, VSCode, and more via MCP for real-time answers.

Popular Providers

Integrations by component

Setup

Usage

Popular Providers

Integrations by component

​Setup

​Usage

Setup

Usage