[install-help]: Responses Not Based on the Specified Directory or File Content #94

aef5748 · 2024-10-30T08:51:19Z

When asking questions and specifying a directory or file for reference, the responses may sometimes include content that is not found within the specified directory or file. This is particularly likely to occur when a directory is chosen as the basis for the inquiry.

Is there any specific setting or way of asking questions that would allow for accurately finding content based on the inquiry and the specified document?

kyteinsky · 2024-10-30T12:10:47Z

Hi, how did you confirm this?
Do you see files from other directories when using "Selective context" and specifying a directory in the assistant?

aef5748 · 2024-11-01T09:19:23Z

I have designated specific folders such as the green boxed part (about 42 files), and asked questions to help with the collection

But the reply information is meaningless, and the files in the red box are not the content of the data I specified.

kyteinsky · 2024-11-05T23:47:08Z

There seem to be two issues:

Weird answer (what does it say at the top of the answer though?)
This could be because of the model's abilities to handle certain types of data. Try formatting the "Meeting minutes.ods" file (in a new file to preserve the original file) to get a better answer.
Files from outside the specified scope
Did you reinstall nextcloud during testing by any chance? There might be mixups of file IDs from an indexing of the previous nextcloud install. Try completely removing context chat (with data, its in a docker volume, use this command: docker volume rm nc_app_context_chat_backend_data) and reinstalling. For faster indexing of one user, you can use this command: occ context_chat:scan <user_id>

aef5748 · 2024-11-07T03:15:44Z

Weird answer (what does it say at the top of the answer though?)

The top answer is as follows

If I change other models, maybe it can be solved?

Files from outside the specified scope
Did you reinstall nextcloud during testing by any chance?

Nextcloud was not reinstalled during the test.
I upgraded from Nextcloud26 to Nextcloud30, and then installed context_chat.

I am currently trying to delete the files in persistent_storage/vector_db_data and reinstall context_chat, but it seems that there is no action to transfer the files to context_chat (I have executed php occ background-job:worker -v -t 60 "OCA\ContextChat\BackgroundJobs\IndexerJob ")

Executing occ context_chat:scan <user_id> will pop up the following error (It will be normal after repeating it a few times)

From the log, I can see that there are messages about receiving data (which will keep recurring), but I don’t see any tasks for uploading files to Nextcloud.
context_chat_backend_error_logs_20241107.txt

github-actions · 2024-12-07T07:20:13Z

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

kyteinsky · 2024-12-11T19:55:00Z

hi, sorry for the late reply.

Executing occ context_chat:scan <user_id> will pop up the following error

this is the way to do that.

From the logs it looks like a big file. What are the sizes of your test files? It seems like there was a request timeout by your proxy server since the file transfer took too long.
It is not recommended to use that large files, we'll in future either limit the max file size (configurable) or make the indexing async.

kyteinsky · 2024-12-11T20:00:50Z

If I change other models, maybe it can be solved?

maybe, or changing the sampler settings like temperature, top_p, etc., or try with different formats for the same data, arranging it around so its easier for the llm to understand it. It can't be done directly since you don't exactly know how the parser will parse the ods file so probably a text-like document with the data would be better.

aef5748 · 2024-12-16T09:17:02Z

From the logs it looks like a big file. What are the sizes of your test files?

They are all default files after account creation, with a maximum size of about 14.3MB.
I have rerun it several times, and the location where the error message appears is not necessarily in the case of large files. Sometimes such error messages will also appear in files of hundreds of KB.

kyteinsky · 2024-12-19T18:05:06Z

I'm not entirely sure what happened but I guess the collectives folder was a bit too large to be processed synchronously when used in selective context (it wasn't indexed before). So I would suggest to first index the collectives folder with occ context_chat:scan -d 'Collectives' <user_id> and then try the same thing.
Also, please use the latest version of context_chat and context_chat_backend for this.

DaphneMuller · 2025-01-03T17:23:52Z

it is not recommended to use that large files

maybe document that in the ai docs too!

aef5748 added the help wanted label Oct 30, 2024

github-actions bot added the stale label Dec 7, 2024

kyteinsky removed the stale label Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[install-help]: Responses Not Based on the Specified Directory or File Content #94

[install-help]: Responses Not Based on the Specified Directory or File Content #94

aef5748 commented Oct 30, 2024

kyteinsky commented Oct 30, 2024

aef5748 commented Nov 1, 2024 •

edited

Loading

kyteinsky commented Nov 5, 2024

aef5748 commented Nov 7, 2024 •

edited

Loading

github-actions bot commented Dec 7, 2024

kyteinsky commented Dec 11, 2024

kyteinsky commented Dec 11, 2024

aef5748 commented Dec 16, 2024

kyteinsky commented Dec 19, 2024

DaphneMuller commented Jan 3, 2025

[install-help]: Responses Not Based on the Specified Directory or File Content #94

[install-help]: Responses Not Based on the Specified Directory or File Content #94

Comments

aef5748 commented Oct 30, 2024

kyteinsky commented Oct 30, 2024

aef5748 commented Nov 1, 2024 • edited Loading

kyteinsky commented Nov 5, 2024

aef5748 commented Nov 7, 2024 • edited Loading

github-actions bot commented Dec 7, 2024

kyteinsky commented Dec 11, 2024

kyteinsky commented Dec 11, 2024

aef5748 commented Dec 16, 2024

kyteinsky commented Dec 19, 2024

DaphneMuller commented Jan 3, 2025

aef5748 commented Nov 1, 2024 •

edited

Loading

aef5748 commented Nov 7, 2024 •

edited

Loading