[bugfix] fix case for agent when memory bank registered without specifying provider_id #264

yanxi0830 · 2024-10-17T23:21:30Z

Context

agent cold start may have cases to register memory_banks without specifying provider_id.

llama-stack/llama_stack/providers/impls/meta_reference/agents/agent_instance.py

Lines 642 to 648 in 9fcf5d5

    
           if session_info.memory_bank_id is None: 
        
               bank_id = f"memory_bank_{session_id}" 
        
               memory_bank = VectorMemoryBankDef( 
        
                   identifier=bank_id, 
        
                   embedding_model="all-MiniLM-L6-v2", 
        
                   chunk_size_in_tokens=512, 
        
               )

Before Fix

Calling register_memory_bank w/o provider_id fail

response = await client.register_memory_bank(
    VectorMemoryBankDef(
        identifier="test_bank2",
        embedding_model="all-MiniLM-L6-v2",
        chunk_size_in_tokens=512,
        overlap_size_in_tokens=64,
    )
)

After Fix

Unit Test

ashwinb

We could add a test case in providers/tests/agents/test_agents.py for this?

yanxi0830 · 2024-10-18T00:27:48Z

We could add a test case in providers/tests/agents/test_agents.py for this?

done!

…fying provider_id (#264) * fix case where memory bank is registered without provider_id * memory test * agents unit test

* docker compose ollama * comment * update compose file * readme for distributions * readme * move distribution folders * move distribution/templates to distributions/ * rename * kill distribution/templates * readme * readme * build/developer cookbook/new api provider * developer cookbook * readme * readme * [bugfix] fix case for agent when memory bank registered without specifying provider_id (#264) * fix case where memory bank is registered without provider_id * memory test * agents unit test * Add an option to not use elastic agents for meta-reference inference (#269) * Allow overridding checkpoint_dir via config * Small rename * Make all methods `async def` again; add completion() for meta-reference (#270) PR #201 had made several changes while trying to fix issues with getting the stream=False branches of inference and agents API working. As part of this, it made a change which was slightly gratuitous. Namely, making chat_completion() and brethren "def" instead of "async def". The rationale was that this allowed the user (within llama-stack) of this to use it as: ``` async for chunk in api.chat_completion(params) ``` However, it causes unnecessary confusion for several folks. Given that clients (e.g., llama-stack-apps) anyway use the SDK methods (which are completely isolated) this choice was not ideal. Let's revert back so the call now looks like: ``` async for chunk in await api.chat_completion(params) ``` Bonus: Added a completion() implementation for the meta-reference provider. Technically should have been another PR :) * Improve an important error message * update ollama for llama-guard3 * Add vLLM inference provider for OpenAI compatible vLLM server (#178) This PR adds vLLM inference provider for OpenAI compatible vLLM server. * Create .readthedocs.yaml Trying out readthedocs * Update event_logger.py (#275) spelling error * vllm * build templates * delete templates * tmp add back build to avoid merge conflicts * vllm * vllm --------- Co-authored-by: Ashwin Bharambe <[email protected]> Co-authored-by: Ashwin Bharambe <[email protected]> Co-authored-by: Yuan Tang <[email protected]> Co-authored-by: raghotham <[email protected]> Co-authored-by: nehal-a2z <[email protected]>

fix case where memory bank is registered without provider_id

f0600a3

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 17, 2024

yanxi0830 changed the title ~~[bugfix] fix memory bank is registered without provider_id~~ [bugfix] fix case for agent when memory bank registered without specifying provider_id Oct 17, 2024

yanxi0830 marked this pull request as ready for review October 17, 2024 23:22

yanxi0830 requested review from ashwinb, hardikjshah, dltn and raghotham as code owners October 17, 2024 23:22

ashwinb approved these changes Oct 17, 2024

View reviewed changes

yanxi0830 added 2 commits October 17, 2024 16:43

memory test

4667c1f

agents unit test

6d7f341

yanxi0830 merged commit be3c5c0 into main Oct 18, 2024
4 checks passed

yanxi0830 deleted the bugfix_agent_memory branch October 18, 2024 00:28

yanxi0830 added a commit that referenced this pull request Oct 21, 2024

[bugfix] fix case for agent when memory bank registered without speci…

2f5c410

…fying provider_id (#264) * fix case where memory bank is registered without provider_id * memory test * agents unit test

yanxi0830 mentioned this pull request Oct 21, 2024

attachment_behavior not working for accessing remote files meta-llama/llama-stack-apps#95

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] fix case for agent when memory bank registered without specifying provider_id #264

[bugfix] fix case for agent when memory bank registered without specifying provider_id #264

yanxi0830 commented Oct 17, 2024 •

edited

Loading

ashwinb left a comment

yanxi0830 commented Oct 18, 2024

	if session_info.memory_bank_id is None:
	bank_id = f"memory_bank_{session_id}"
	memory_bank = VectorMemoryBankDef(
	identifier=bank_id,
	embedding_model="all-MiniLM-L6-v2",
	chunk_size_in_tokens=512,
	)

[bugfix] fix case for agent when memory bank registered without specifying provider_id #264

[bugfix] fix case for agent when memory bank registered without specifying provider_id #264

Conversation

yanxi0830 commented Oct 17, 2024 • edited Loading

Context

Before Fix

After Fix

Unit Test

ashwinb left a comment

Choose a reason for hiding this comment

yanxi0830 commented Oct 18, 2024

yanxi0830 commented Oct 17, 2024 •

edited

Loading