Llamafile
Llamafile 可以让你使用单个文件分发和运行 LLM。
Llamafile 通过将 llama.cpp 与 Cosmopolitan Libc 结合到一个框架中,将所有 LLM 的复杂性折叠为一个单文件可执行文件(称为 "llamafile"),在大多数计算机上本地运行,无需安装。
设置
- 下载你想使用的模型的 llamafile。你可以在 HuggingFace 上找到许多 llamafile 格式的模型。在本指南中,我们将下载一个小模型,
TinyLlama-1.1B-Chat-v1.0.Q5_K_M
。注意:如果你没有wget
,你可以通过这个 链接 下载模型。
wget https://huggingface.co/jartine/TinyLlama-1.1B-Chat-v1.0-GGUF/resolve/main/TinyLlama-1.1B-Chat-v1.0.Q5_K_M.llamafile
- 使 llamafile 可执行。首先,如果你还没有这样做,请打开一个终端。如果你使用 MacOS、Linux 或 BSD,你需要使用
chmod
授予计算机执行这个新文件的权限(见下文)。如果你使用 Windows,将文件重命名为在末尾添加 ".exe" 的形式(模型文件应命名为TinyLlama-1.1B-Chat-v1.0.Q5_K_M.llamafile.exe
)。
chmod +x TinyLlama-1.1B-Chat-v1.0.Q5_K_M.llamafile # 如果你使用 MacOS、Linux 或 BSD,请运行此命令
- 以 "服务器模式" 运行 llamafile:
./TinyLlama-1.1B-Chat-v1.0.Q5_K_M.llamafile --server --nobrowser
现在你可以调用 llamafile 的 REST API。默认情况下,llamafile 服务器监听在 http://localhost:8080。你可以在这里找到完整的服务器文档 链接。你可以直接通过 REST API 与 llamafile 交互,但这里我们将展示如何使用 LangChain 与其交互。
使用
from langchain_community.llms.llamafile import Llamafile
llm = Llamafile()
llm.invoke("Tell me a joke")
'? \nI\'ve got a thing for pink, but you know that.\n"Can we not talk about work anymore?" - What did she say?\nI don\'t want to be a burden on you.\nIt\'s hard to keep a good thing going.\nYou can\'t tell me what I want, I have a life too!'
要流式传输标记,请使用 .stream(...)
方法:
query = "Tell me a joke"
for chunks in llm.stream(query):
print(chunks, end="")
print()
.
- She said, "I’m tired of my life. What should I do?"
- The man replied, "I hear you. But don’t worry. Life is just like a joke. It has its funny parts too."
- The woman looked at him, amazed and happy to hear his wise words. - "Thank you for your wisdom," she said, smiling. - He replied, "Any time. But it doesn't come easy. You have to laugh and keep moving forward in life."
- She nodded, thanking him again. - The man smiled wryly. "Life can be tough. Sometimes it seems like you’re never going to get out of your situation."
- He said, "I know that. But the key is not giving up. Life has many ups and downs, but in the end, it will turn out okay."
- The woman's eyes softened. "Thank you for your advice. It's so important to keep moving forward in life," she said. - He nodded once again. "You’re welcome. I hope your journey is filled with laughter and joy."
- They both smiled and left the bar, ready to embark on their respective adventures.
要了解更多关于 LangChain Expressive Language 和 LLM 上可用方法的信息,请参阅 LCEL Interface。