Fixing Tool Calling #11

matrushka · 2024-07-26T09:48:12Z

Existing implementation expects function call arguments as strings (instead of objects) and the roundtrip for feeding back the result requires an updated message structure.

Also updated the @ai/provider dependencies and adjusted the tests to make everything is up-to-date.

Tested with mistral and was able to get it working well

louis030195 · 2024-07-26T12:18:44Z

@matrushka great! also trying to get ollama provider with function calling to work for https://github.com/louis030195/screen-pipe desktop app

facing this with llama3.1 atm:

AI_APICallError: Invalid JSON response
    at eval (index.mjs:531:11)
    at async postToApi (index.mjs:374:14)
    at async OllamaChatLanguageModel.doGenerate (index.mjs:461:40)
    at async fn (index.mjs:1565:30)
    at async eval (index.mjs:216:22)
    at async _retryWithExponentialBackoff (index.mjs:269:12)
    at async fn (index.mjs:1555:32)
    at async eval (index.mjs:216:22)
    at async handleSendMessage (chat-list-openai-v2.tsx:142:20)Caused by: AI_TypeValidationError: Type validation failed: Value: {"model":"llama3.1","created_at":"2024-07-26T12:36:36.798629Z","message":{"role":"assistant","content":"","tool_calls":[{"function":{"name":"query_screenpipe","arguments":{"queries":[{"end_date":"2024-07-26T12:36:32Z","limit":10,"offset":0,"q":"email","start_date":"2024-07-26T12:36:32Z"}]}}}]},"done_reason":"stop","done":true,"total_duration":4465412250,"load_duration":24267417,"prompt_eval_count":1248,"prompt_eval_duration":2426543000,"eval_count":83,"eval_duration":2009040000}.
Error message: [
  {
    "code": "invalid_type",
    "expected": "string",
    "received": "object",
    "path": [
      "message",
      "tool_calls",
      0,
      "function",
      "arguments"
    ],
    "message": "Expected string, received object"
  }
]
    at safeValidateTypes (index.mjs:237:14)
    at safeParseJSON (index.mjs:280:12)
    at eval (index.mjs:525:24)
    at async postToApi (index.mjs:374:14)
    at async OllamaChatLanguageModel.doGenerate (index.mjs:461:40)
    at async fn (index.mjs:1565:30)
    at async eval (index.mjs:216:22)
    at async _retryWithExponentialBackoff (index.mjs:269:12)
    at async fn (index.mjs:1555:32)
    at async eval (index.mjs:216:22)
    at async handleSendMessage (chat-list-openai-v2.tsx:142:20)Caused by: ZodError
    at get error (index.mjs:699:31)
    at safeValidateTypes (index.mjs:239:33)
    at safeParseJSON (index.mjs:280:12)
    at eval (index.mjs:525:24)
    at async postToApi (index.mjs:374:14)
    at async OllamaChatLanguageModel.doGenerate (index.mjs:461:40)
    at async fn (index.mjs:1565:30)
    at async eval (index.mjs:216:22)
    at async _retryWithExponentialBackoff (index.mjs:269:12)
    at async fn (index.mjs:1555:32)
    at async eval (index.mjs:216:22)
    at async handleSendMessage (chat-list-openai-v2.tsx:142:20)

will test ur PR prob

happy to help anyhow or jump in a call can DM on discord/x at louis030195

sgomez · 2024-07-26T12:52:28Z

Thanks @matrushka ,

I am traveling today and will not have time to check the PR until tomorrow. Is this using the new Ollama 0.3 API?

matrushka · 2024-07-26T13:21:46Z

Tested it with ollama/ollama:0.3.0 and also weirdly 0.2.7 :)

The reference data structure I used is same in both versions (last update 2 weeks ago) https://github.com/ollama/ollama/blob/v0.3.0/server/testdata/tools/messages.json

louis030195 · 2024-07-26T13:51:10Z

@matrushka any idea how i can install this branch as deps in my package.json? tried things here but didnt work https://stackoverflow.com/questions/16350673/depend-on-a-branch-or-tag-using-a-git-url-in-a-package-json

prob because it's a sub dir

(i can't wait tomorrow for the merge)

matrushka · 2024-07-26T14:01:37Z

@matrushka any idea how i can install this branch as deps in my package.json? tried things here but didnt work https://stackoverflow.com/questions/16350673/depend-on-a-branch-or-tag-using-a-git-url-in-a-package-json

prob because it's a sub dir

(i can't wait tomorrow for the merge)

Pull the repo to your local, build it pnpm i && pnpm run build and refer to it by using it's path in your project (in my case I used ../ollama-ai-provider/packages/ollama as the built package is inside the packages directory).

louis030195 · 2024-07-26T14:45:14Z

@matrushka thx

still fails:

const screenpipeQuery = z.object({
  q: z
    .string()
    .describe(
      "The search query matching exact keywords. Use a single keyword that best matches the user intent"
    ),
  contentType: z
    .enum(["ocr", "audio", "all"])
    .describe(
      "The type of content to search for: screenshot data or audio transcriptions"
    ),
  limit: z
    .number()
    .default(5)
    .describe(
      "Number of results to return (default: 5). Don't return more than 50 results as it will be fed to an LLM"
    ),
  offset: z.number().default(0).describe("Offset for pagination (default: 0)"),
  startTime: z
    .string()
    // 1 hour ago
    .default(new Date(Date.now() - 3600000).toISOString())
    .describe("Start time for search range in ISO 8601 format"),
  endTime: z
    .string()
    .default(new Date().toISOString())
    .describe("End time for search range in ISO 8601 format"),
});
const screenpipeMultiQuery = z.object({
  queries: z.array(screenpipeQuery),
});

async function queryScreenpipeNtimes(
  params: z.infer<typeof screenpipeMultiQuery>
) {
  return Promise.all(params.queries.map(queryScreenpipe));
}

// Add this new function to handle screenpipe requests
async function queryScreenpipe(params: z.infer<typeof screenpipeQuery>) {
  try {
    const queryParams = new URLSearchParams({
      q: params.q,
      offset: params.offset.toString(),
      limit: params.limit.toString(),
      start_date: params.startTime,
      end_date: params.endTime,
      content_type: params.contentType,
    });
    console.log("calling screenpipe", JSON.stringify(params));
    const response = await fetch(`http://localhost:3030/search?${queryParams}`);
    if (!response.ok) {
      const text = await response.text();
      throw new Error(`HTTP error! status: ${response.status} ${text}`);
    }
    const result = await response.json();
    console.log("result", result);
    return result;
  } catch (error) {
    console.error("Error querying screenpipe:", error);
    return null;
  }
}

const text = await generateText({
        model: useOllama ? ollama(model) : provider(model),
        tools: {
          query_screenpipe: {
            description:
              "Query the local screenpipe instance for relevant information. You will return multiple queries under the key 'queries'.",
            parameters: screenpipeMultiQuery,
            execute: queryScreenpipeNtimes,
          },
        },
        toolChoice: "required",
        messages: [
        // ...

AI_APICallError: Invalid JSON response
    at eval (index.mjs:531:11)
    at async postToApi (index.mjs:374:14)
    at async OllamaChatLanguageModel.doGenerate (index.mjs:461:40)
    at async fn (index.mjs:1565:30)
    at async eval (index.mjs:216:22)
    at async _retryWithExponentialBackoff (index.mjs:269:12)
    at async fn (index.mjs:1555:32)
    at async eval (index.mjs:216:22)
    at async handleSendMessage (VM1132 chat-list-openai-v2.tsx:121:26)Caused by: AI_TypeValidationError: Type validation failed: Value: {"model":"llama3.1","created_at":"2024-07-26T14:43:26.232Z","message":{"role":"assistant","content":"","tool_calls":[{"function":{"name":"query_screenpipe","arguments":{"queries":[{"end_date":"2024-07-26T14:43:15Z","limit":10,"offset":0,"q":"email","start_date":"2024-07-26T14:00:00Z"}]}}}]},"done_reason":"stop","done":true,"total_duration":11143141958,"load_duration":6814941000,"prompt_eval_count":1248,"prompt_eval_duration":2309899000,"eval_count":83,"eval_duration":2013074000}.
Error message: [
  {
    "code": "invalid_type",
    "expected": "string",
    "received": "object",
    "path": [
      "message",
      "tool_calls",
      0,
      "function",
      "arguments"
    ],
    "message": "Expected string, received object"
  }
]
    at safeValidateTypes (index.mjs:237:14)
    at safeParseJSON (index.mjs:280:12)
    at eval (index.mjs:525:24)
    at async postToApi (index.mjs:374:14)
    at async OllamaChatLanguageModel.doGenerate (index.mjs:461:40)
    at async fn (index.mjs:1565:30)
    at async eval (index.mjs:216:22)
    at async _retryWithExponentialBackoff (index.mjs:269:12)
    at async fn (index.mjs:1555:32)
    at async eval (index.mjs:216:22)
    at async handleSendMessage (VM1132 chat-list-openai-v2.tsx:121:26)Caused by: ZodError
    at get error (index.mjs:699:31)
    at safeValidateTypes (index.mjs:239:33)
    at safeParseJSON (index.mjs:280:12)
    at eval (index.mjs:525:24)
    at async postToApi (index.mjs:374:14)
    at async OllamaChatLanguageModel.doGenerate (index.mjs:461:40)
    at async fn (index.mjs:1565:30)
    at async eval (index.mjs:216:22)
    at async _retryWithExponentialBackoff (index.mjs:269:12)
    at async fn (index.mjs:1555:32)
    at async eval (index.mjs:216:22)
    at async handleSendMessage (VM1132 chat-list-openai-v2.tsx:121:26)

matrushka · 2024-07-26T15:24:12Z

@louis030195 seems like you couldn't configure it properly (or maybe forgot to run npm i in your project after changing the package). I guess you'll have to try again or wait until this PR is accepted.

louis030195 · 2024-07-26T16:56:54Z

got it working!! llama3.1 function calling is on @matrushka

had to do this hack though:

const { textStream } = ollama
  ? await streamText({
      model: ollama(model),
      prompt: JSON.stringify([
        {
          role: "user",
          content:
            messages.findLast((msg) => msg.role === "user")?.content ||
            inputMessage,
        },
        {
          role: "assistant",
          content: text.toolCalls,
        },
        {
          role: "tool",
          content: text.toolResults,
        },
      ]),
    })
  : await streamText({
      model: provider(model),
      messages: [
        {
          role: "user",
          content:
            messages.findLast((msg) => msg.role === "user")?.content ||
            inputMessage,
        },
        {
          role: "assistant",
          content: text.toolCalls,
        },
        {
          role: "tool",
          content: text.toolResults,
        },
      ],
    });

basically go json umarshall error, suspecting the RAG data was in weird format in message so just sent a string

matrushka · 2024-07-26T17:18:35Z

I had a similar issue as well, seems like when your tool returns an object instead of a string, ollama throws that error. It went away when I encoded the tool response to JSON. Might be interesting to implement that behavior so user don't need to care.

sgomez · 2024-07-28T11:22:31Z

Hi @matrushka!

I think this is not working with the new API. I did a tool call inference as it was working because I was injecting the tools at the system prompt and was able to detect if the response contains a json to infer the tool response.
This is the response from ollama with your code:

"responseMessages": [
    {
      "role": "assistant",
      "content": [
        {
          "type": "text",
          "text": "{\n  \"name\": \"weather\",\n  \"arguments\": {\n    \"location\": \"San Francisco\"\n  }\n} {\n  \"name\": \"cityAttractions\",\n  \"arguments\": {\n    \"city\": \"San Francisco\"\n  }\n}"
        }
      ]
    }
  ],

you can see the json does not contain the tool info but the json inside the answer

If I remove the system tool prompt injection

 "responseMessages": [
    {
      "role": "assistant",
      "content": [
        {
          "type": "text",
          "text": ""
        },
        {
          "type": "tool-call",
          "toolCallId": "KuRIXth",
          "toolName": "cityAttractions",
          "args": {
            "city": "San Francisco"
          }
        },
        {
          "type": "tool-call",
          "toolCallId": "T4FEOtg",
          "toolName": "weather",
          "args": {
            "location": "San Francisco"
          }
        }
      ]
    },
    {
      "role": "tool",
      "content": [
        {
          "type": "tool-result",
          "toolCallId": "T4FEOtg",
          "toolName": "weather",
          "result": {
            "location": "San Francisco",
            "temperature": 79
          }
        }
      ]
    }
  ],

Now content is empty and the Ollama answer contains the tool in the right property.

sgomez · 2024-07-28T11:27:59Z

@matrushka I forgot to say that I have tested with ollama3.1. It may be that some models give more priority to the system prompt, than to tool suggestion. So it is likely that depending on the model it will work one way or the other. In any case, the injection prompt is no longer necessary. Do you mind removing it, or if you prefer I accept your PR as is and I'll take care of finishing it.

…esults as JSON

matrushka · 2024-07-28T13:57:26Z

@sgomez found what you mean and did the changes. Kinda saw similar results to what you said, as depending on the model it might tend to use the tool a bit less. But I think this can be improved with prompting. Tested it with mistral and it used the tools as expected (sometimes needed a bit encouragement in the prompting 😄 ).

sgomez · 2024-08-02T02:09:48Z

Thanks @matrushka! Sorry but I am in holidays and I cannot check this as fast as I'd like it.

There are some dead code now, but nothing to worries. I will remove it before the next release.

matrushka added 3 commits July 26, 2024 10:20

chore: upgrading @ai related dependencies and updating tests

1d04927

chore: fixing the prettier-check script

a6f92c7

fix: fixing the tool calling data models to make it functional again

f34ee88

miurla mentioned this pull request Jul 27, 2024

Support Ollama AI Provider miurla/morphic#215

Closed

fix: removing tools embedded to the system prompt and encoding tool r…

06e2dd0

…esults as JSON

sgomez merged commit 13a189a into sgomez:main Aug 2, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing Tool Calling #11

Fixing Tool Calling #11

matrushka commented Jul 26, 2024 •

edited

Loading

louis030195 commented Jul 26, 2024 •

edited

Loading

sgomez commented Jul 26, 2024

matrushka commented Jul 26, 2024 •

edited

Loading

louis030195 commented Jul 26, 2024 •

edited

Loading

matrushka commented Jul 26, 2024

louis030195 commented Jul 26, 2024 •

edited

Loading

matrushka commented Jul 26, 2024 •

edited

Loading

louis030195 commented Jul 26, 2024 •

edited

Loading

matrushka commented Jul 26, 2024

sgomez commented Jul 28, 2024

sgomez commented Jul 28, 2024 •

edited

Loading

matrushka commented Jul 28, 2024

sgomez commented Aug 2, 2024

Fixing Tool Calling #11

Fixing Tool Calling #11

Conversation

matrushka commented Jul 26, 2024 • edited Loading

louis030195 commented Jul 26, 2024 • edited Loading

sgomez commented Jul 26, 2024

matrushka commented Jul 26, 2024 • edited Loading

louis030195 commented Jul 26, 2024 • edited Loading

matrushka commented Jul 26, 2024

louis030195 commented Jul 26, 2024 • edited Loading

matrushka commented Jul 26, 2024 • edited Loading

louis030195 commented Jul 26, 2024 • edited Loading

matrushka commented Jul 26, 2024

sgomez commented Jul 28, 2024

sgomez commented Jul 28, 2024 • edited Loading

matrushka commented Jul 28, 2024

sgomez commented Aug 2, 2024

matrushka commented Jul 26, 2024 •

edited

Loading

louis030195 commented Jul 26, 2024 •

edited

Loading

matrushka commented Jul 26, 2024 •

edited

Loading

louis030195 commented Jul 26, 2024 •

edited

Loading

louis030195 commented Jul 26, 2024 •

edited

Loading

matrushka commented Jul 26, 2024 •

edited

Loading

louis030195 commented Jul 26, 2024 •

edited

Loading

sgomez commented Jul 28, 2024 •

edited

Loading