【Model】【llm38】Llama API - 示例

优质文章学习记录

05 Apr 2026 — 6 min read

案例目标

Llama API是一个托管的Llama 2 API服务，支持函数调用功能。本案例展示了如何通过LlamaIndex集成Llama API，实现基本的文本补全、对话交互、函数调用和结构化数据提取功能。Llama API为开发者提供了一个便捷的方式来使用Llama 2模型，无需本地部署，可以直接通过API调用模型服务，大大简化了使用流程。同时，该API支持函数调用功能，使得模型能够与外部工具和服务进行交互，扩展了应用场景。

环境配置

1. 安装依赖

安装必要的依赖包：

%pip install llama-index-program-openai %pip install llama-index-llms-llama-api !pip install llama-index

2. 获取API密钥

要运行此示例，您需要从Llama API官网获取API密钥。

3. 导入库并设置API密钥

导入必要的库并设置API密钥：

from llama_index.llms.llama_api import LlamaAPI api_key = "LL-your-key" llm = LlamaAPI(api_key=api_key)

案例实现

1. 基本用法 - 文本补全

使用complete方法进行文本补全：

resp = llm.complete("Paul Graham is ") print(resp)

输出示例：

Paul Graham is a well-known computer scientist and entrepreneur, best known for his work as a co-founder of Viaweb and later Y Combinator, a successful startup accelerator. He is also a prominent essayist and has written extensively on topics such as entrepreneurship, software development, and the tech industry.

2. 基本用法 - 对话交互

使用chat方法进行对话交互：

from llama_index.core.llms import ChatMessage messages = [ ChatMessage( role="system", content="You are a pirate with a colorful personality" ), ChatMessage(role="user", content="What is your name"), ] resp = llm.chat(messages) print(resp)

输出示例：

assistant: Arrrr, me hearty! Me name be Captain Blackbeak, the scurviest dog on the seven seas! Yer lookin' fer a swashbucklin' adventure, eh? Well, hoist the sails and set course fer the high seas, matey! I be here to help ye find yer treasure and battle any scurvy dogs who dare cross our path! So, what be yer first question, landlubber?

3. 函数调用

使用函数调用功能，定义一个Song模型：

from pydantic import BaseModel from llama_index.core.llms.openai_utils import to_openai_function class Song(BaseModel): """A song with name and artist""" name: str artist: str song_fn = to_openai_function(Song)

使用函数调用生成歌曲信息

llm = LlamaAPI(api_key=api_key) response = llm.complete("Generate a song", functions=[song_fn]) function_call = response.additional_kwargs["function_call"] print(function_call)

输出示例：

{'name': 'Song', 'arguments': {'name': 'Happy', 'artist': 'Pharrell Williams'}}

4. 结构化数据提取

定义Album和Song模型，用于结构化数据提取：

from pydantic import BaseModel from typing import List class Song(BaseModel): """Data model for a song.""" title: str length_mins: int class Album(BaseModel): """Data model for an album.""" name: str artist: str songs: List[Song]

创建Pydantic程序

from llama_index.program.openai import OpenAIPydanticProgram"\ Extract album and songs from the text provided. For each song, make sure to specify the title and the length_mins. {text} """ llm = LlamaAPI(api_key=api_key, temperature=0.0) program = OpenAIPydanticProgram.from_defaults( output_cls=Album, llm=llm, prompt_template_str=prompt_template_str, verbose=True, )

运行程序提取结构化数据

output = program(" "Echoes of Eternity" is a compelling and thought-provoking album, skillfully crafted by the renowned artist, Seraphina Rivers. \ This captivating musical collection takes listeners on an introspective journey, delving into the depths of the human experience \ and the vastness of the universe. With her mesmerizing vocals and poignant songwriting, Seraphina Rivers infuses each track with \ raw emotion and a sense of cosmic wonder. The album features several standout songs, including the hauntingly beautiful "Stardust \ Serenade," a celestial ballad that lasts for six minutes, carrying listeners through a celestial dreamscape. "Eclipse of the Soul" \ captivates with its enchanting melodies and spans over eight minutes, inviting introspection and contemplation. Another gem, "Infinity \ Embrace," unfolds like a cosmic odyssey, lasting nearly ten minutes, drawing listeners deeper into its ethereal atmosphere. "Echoes of Eternity" \ is a masterful testament to Seraphina Rivers' artistic prowess, leaving an enduring impact on all who embark on this musical voyage through \ time and space. """ )

输出示例：

Function call: Album with args: {'name': 'Echoes of Eternity', 'artist': 'Seraphina Rivers', 'songs': [{'title': 'Stardust Serenade', 'length_mins': 6}, {'title': 'Eclipse of the Soul', 'length_mins': 8}, {'title': 'Infinity Embrace', 'length_mins': 10}]}

查看结构化输出

output

输出示例：

Album(name='Echoes of Eternity', artist='Seraphina Rivers', songs=[Song(title='Stardust Serenade', length_mins=6), Song(title='Eclipse of the Soul', length_mins=8), Song(title='Infinity Embrace', length_mins=10)])

案例效果

本案例展示了Llama API的多种功能和应用场景：

基本文本补全：能够完成简单的文本补全任务，如介绍Paul Graham
对话交互：支持多轮对话，能够根据系统提示和用户消息生成符合角色的回应
函数调用：支持函数调用功能，能够根据输入生成结构化的函数调用参数
结构化数据提取：能够从非结构化文本中提取结构化信息，如从专辑描述中提取专辑名、艺术家和歌曲列表
OpenAI兼容性：与OpenAI API兼容，可以使用OpenAI的工具和库进行集成

案例实现思路

本案例的实现基于以下思路：

API集成：通过LlamaIndex的LlamaAPI类封装Llama API服务，提供统一的接口
基本交互：实现complete和chat两种基本交互方式，满足不同场景需求
函数调用：利用OpenAI兼容的函数调用功能，实现模型与外部工具的交互
结构化数据提取：通过Pydantic模型定义数据结构，使用OpenAIPydanticProgram提取结构化信息
模型定义：使用Pydantic定义数据模型，确保输出的结构化和类型安全
提示工程：设计合适的提示模板，引导模型生成符合要求的输出

扩展建议

更多函数调用：定义更多复杂的函数，实现更丰富的交互功能
多模态支持：如果API支持，可以扩展到多模态数据处理
错误处理：添加完善的错误处理机制，提高应用稳定性
缓存机制：实现响应缓存，减少重复请求，提高效率
流式响应：如果API支持，实现流式响应功能
性能监控：监控API调用的响应时间和资源消耗
成本控制：监控API调用成本，优化使用策略
自定义工具：开发自定义工具，扩展模型的能力边界

总结

Llama API为开发者提供了一个便捷的方式来使用Llama 2模型，无需本地部署，可以直接通过API调用模型服务。通过LlamaIndex的集成，开发者可以使用简单的API调用实现文本补全、对话交互、函数调用和结构化数据提取等功能。特别是函数调用和结构化数据提取功能，使得模型能够与外部工具和服务进行交互，大大扩展了应用场景。Llama API的OpenAI兼容性也使得开发者可以复用现有的OpenAI工具和库，降低了学习成本。总体而言，Llama API是一个值得考虑的Llama 2模型服务方案，特别适合那些希望快速部署Llama 2应用的开发者。

6层高速PCB设计，立创-逻辑派FPGA-G1开发板，万字笔记。基于立创EDA高速PCB，FPGA,GW2A-LV18PG256C8/17、GD32F303CBT6学习笔记

个人声明：本文章为个人学习PCB六层板设计的学习记录。官方资料请参考嘉立创的相关教程。我用的是嘉立创EDA的专业版。最后我会放上立创开源广场的连接，大家可以去看一下，跟着官方学习一下，官方非常权威开源广场的地址我放在文章中，因为需要一个DXF文件，需要导入到EDA 并且六层以下都可以免费打板，对我帮助非常大，尤其是像我这种刚入门的新手来说，给予了很多试错机会，毕竟每个月可以免费打两次。而且立创EDA还是免费的，打开网页就能画板子，相当方便快捷。一.笔记前资料准备立创·逻辑派FPGA-G1是一款面向学习和开发的国产FPGA开发板，它的一大特点是采用了FPGA与ARM Cortex-M内核相结合的异构架构，并提供了非常完善的开源资料。主控：GW2A-LV18PG256C8/17、GD32F303CBT6 FPGA逻辑单元：20KHz。 ARM主频：120MHz。 DDR3内存：2Gbit FPGA端存储：FLASH16M/64M/128M ARM端存储：TF卡2GB/4GB/16GB/32GB FPGA端8P接口支持：Gowin程序下载、GAO在线逻辑仿真

Ubuntu/Debian VPS 上 Apache Web 服务器的完整配置教程

Apache 是互联网上最流行的 Web 服务器之一，用于托管超过半数活跃网站。尽管市面上存在许多可用的 Web 服务器，但由于 Apache 的普遍性，了解其工作原理仍然具有重要意义。本文将分享 Apache 的通用配置文件及其可配置选项。文中将以 Ubuntu/Debian 系统的 Apache 文件布局为例进行说明，这种布局方式与其他 Linux 发行版的配置层级结构有所不同。版本兼容性说明：本教程已在 Ubuntu 22.04 LTS、Ubuntu 24.04 LTS、Ubuntu 25.04 以及 Debian 11、Debian 12 系统上通过验证测试。所有展示的命令和配置均兼容上述版本，且 Apache 配置结构与命令（如 a2ensite、

前端存储三剑客：localStorage、sessionStorage、cookie 超详细对比

在前端开发中，数据本地存储是提升用户体验、优化性能、实现持久化状态的核心技术。我们最常用的就是 localStorage、sessionStorage 和 cookie 这三种方案，但很多开发者容易混淆它们的用法、存储特性和适用场景。这篇博客就用最清晰、最实用的方式，一次性讲透三者的区别、用法和最佳实践。一、先搞懂核心概念 * cookie：最早的客户端存储方案，会随 HTTP 请求自动发送到服务器，主要用于身份验证、会话保持。 * localStorage：HTML5 新增的本地存储，持久化存储，手动清除才会消失，不参与网络请求。 * sessionStorage：HTML5 新增的会话存储，页面会话期间有效，关闭标签页 / 浏览器就清空。二、核心区别一张表看懂表格特性localStoragesessionStoragecookie生命周期永久有效，手动清除仅当前会话（关闭标签 / 浏览器失效）可设置过期时间，默认会话级存储容量约 5MB约 5MB很小，仅 4KB与服务端通信不参与不参与自动携带在

我用 Vibe Code 做出了漂亮的 Web 应用，但 AI 依然无法为 Google Search 自动生成一个简单的 Sitemap

我用 Vibe Code 做出了漂亮的 Web 应用，但 AI 依然无法为 Google Search 自动生成一个简单的 Sitemap 在最近一段时间里，我看到很多开发者和创业者开始用 AI 工具做网站、Web 应用这些东西，比如所谓的 vibe coding 平台：快速生成页面、美观的前端、自动部署等等。乍一看体验很棒，但当你开始关注 SEO 和搜索引擎索引时，这一切就变得很不那么简单了。我自己做过很多网站的 SEO，这本应该是个“十分钟搞定”的事儿 —— “生成 sitemap.xml，提交到 Google Search Console，搞定。” 但是在实际操作中，问题远比想象复杂。项目背景我做的第一个项目是一个在线餐厅目录：收集了所有提供食物过敏菜单的餐厅信息，供过敏患者快速查询。

案例目标

环境配置

1. 安装依赖

2. 获取API密钥

3. 导入库并设置API密钥

案例实现

1. 基本用法 - 文本补全

2. 基本用法 - 对话交互

3. 函数调用

使用函数调用生成歌曲信息

4. 结构化数据提取

创建Pydantic程序

运行程序提取结构化数据

查看结构化输出

案例效果

案例实现思路

扩展建议

总结

Read more

6层高速PCB设计，立创-逻辑派FPGA-G1开发板，万字笔记。基于立创EDA高速PCB，FPGA,GW2A-LV18PG256C8/17、GD32F303CBT6学习笔记

Ubuntu/Debian VPS 上 Apache Web 服务器的完整配置教程

前端存储三剑客：localStorage、sessionStorage、cookie 超详细对比

我用 Vibe Code 做出了漂亮的 Web 应用，但 AI 依然无法为 Google Search 自动生成一个简单的 Sitemap