2023年5月21日 15:29:21go评论76阅读模式

英文:

How to Implement Streaming API Endpoint with Next.js 13 Route Handlers Using LangChain?

问题

I am trying to create an API endpoint using Nextjs 13's new Route Handler solution.
This API uses LangChain, and streams the response back to the frontend.
When calling the OpenAI wrapper class, I am passing in the Streaming property, and supplying the callback function. This callback function then provides the stream as chunks (ie. tokens).
I want to stream these tokens to the frontend to output the AI's response as it's being generated.

I was able to get this working using the "old" API route solution with the following code:

import { OpenAI } from "langchain/llms/openai";

export default async function handler(req, res) {
  const chat = new OpenAI({
    modelName: "gpt-3.5-turbo",
    streaming: true,
    callbacks: [
      {
        handleLLMNewToken(token) {
          res.write(token);
        },
      },
    ],
  });

  await chat.call("Write me a song about sparkling water.");

  res.end();
}

I am trying to convert this code to the new Route Handler solution, but I haven't been able to get this working.

I have tried many different approaches to this, with no luck.

For example:

import { NextResponse } from "next/server";
import { OpenAI } from "langchain/llms/openai";

export const dynamic = "force-dynamic";
export const revalidate = true;

export async function GET(req, res) {
  const chat = new OpenAI({
    modelName: "gpt-3.5-turbo",
    streaming: true,
    callbacks: [
      {
        handleLLMNewToken(token) {
          // res.write(token);
          return new NextResponse.json(token);
        },
      },
    ],
  });

  await chat.call("Write me a song about sparkling water.");
}

There just seems to be no way to "write" the tokens to the response as they are streamed to the Route Handler's response.

Any assistance will be GREATLY appreciated.

英文:

I was able to get this working using the "old" API route solution with the following code:

import { OpenAI } from &quot;langchain/llms/openai&quot;;

export default async function handler(req, res) {
  const chat = new OpenAI({
    modelName: &quot;gpt-3.5-turbo&quot;,
    streaming: true,
    callbacks: [
      {
        handleLLMNewToken(token) {
          res.write(token);
        },
      },
    ],
  });

  await chat.call(&quot;Write me a song about sparkling water.&quot;);

  res.end();
}

I am trying to convert this code to the new Route Handler solution, but I haven't been able to get this working.

I have tried many different approaches to this, with no luck.

For example:

import { NextResponse } from &quot;next/server&quot;;

import { OpenAI } from &quot;langchain/llms/openai&quot;;

export const dynamic = &quot;force-dynamic&quot;;
export const revalidate = true;

export async function GET(req, res) {
  const chat = new OpenAI({
    modelName: &quot;gpt-3.5-turbo&quot;,
    streaming: true,
    callbacks: [
      {
        handleLLMNewToken(token) {
          // res.write(token);
          return new NextResponse.json(token);
        },
      },
    ],
  });

  await chat.call(&quot;Write me a song about sparkling water.&quot;);
}

There just seems to be no way to "write" the tokens to the response as they are streamed to the Route Handler's response.

Any assistance will be GREATLY appreciated.

答案1

得分: 4

在路由处理程序中，我使用TransformStream类创建了一个新的流对象。然后，我将生成的令牌写入此流对象。由于流需要传输字节，我使用TextEncoder将令牌编码为Uint8Array值。

最后，我将流的可读属性作为API响应返回。这似乎可以解决问题，尽管比旧API路由方法的解决方案略复杂。

import { OpenAI } from "langchain/llms/openai";

export const dynamic = "force-dynamic";
export const revalidate = true;

async function runLLMChain() {
  // 创建编码器以将令牌（字符串）转换为Uint8Array
  const encoder = new TextEncoder();

  // 创建一个TransformStream，用于写入生成的令牌作为响应
  const stream = new TransformStream();
  const writer = stream.writable.getWriter();

  const chat = new OpenAI({
    modelName: "gpt-3.5-turbo",
    streaming: true,
    callbacks: [
      {
        async handleLLMNewToken(token) {
          await writer.ready;
          await writer.write(encoder.encode(`${token}`));
        },
        async handleLLMEnd() {
          await writer.ready;
          await writer.close();
        },
      },
    ],
  });
  chat.call("Write me a song about sparkling water.");

  // 返回可读流
  return stream.readable;
}

export async function GET(req) {
  const stream = runLLMChain();
  return new Response(await stream);
}

这是您提供的代码的翻译部分。

英文:

I think I might have a solution.

In the Route Handler, I create a new stream object using the TransformStream class.
I then write the tokens to this stream object as they are generated.
Because the stream expects bytes to be transferred to it, I use the TextEncoder to encode the token to a Uint8Array value.

Lastly, I then return this readable property of the stream in our API response.
This seems to do the trick, although slightly more complex than the solution from the older API route approach.

import { OpenAI } from &quot;langchain/llms/openai&quot;;

export const dynamic = &quot;force-dynamic&quot;;
export const revalidate = true;

async function runLLMChain() {
  // Create encoding to convert token (string) to Uint8Array
  const encoder = new TextEncoder();

  // Create a TransformStream for writing the response as the tokens as generated
  const stream = new TransformStream();
  const writer = stream.writable.getWriter();

  const chat = new OpenAI({
    modelName: &quot;gpt-3.5-turbo&quot;,
    streaming: true,
    callbacks: [
      {
        async handleLLMNewToken(token) {
          await writer.ready;
          await writer.write(encoder.encode(`${token}`));
        },
        async handleLLMEnd() {
          await writer.ready;
          await writer.close();
        },
      },
    ],
  });
  chat.call(&quot;Write me a song about sparkling water.&quot;);

  // Return the readable stream
  return stream.readable;
}

export async function GET(req) {
  const stream = runLLMChain();
  return new Response(await stream);
}

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何使用LangChain在Next.js 13路由处理程序中实现流式API端点？

问题

答案1

LESS | 目标元素无特定父类

Sequelize Associations Error Users.hasMany called with something that's not a subclass of Sequelize.Model. when associating between 2 models

Puppeteer – 如何获取第一个元素的元素数组？

Destructure an array of objects and create a new array with array keys as a new value in JavaScript

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论