[LLM] [NPU] StaticLLMPipeline: Import model if blob file is present in Stateful pipeline #1494

smirnov-alexey · 2025-01-07T16:29:26Z

Depends on openvinotoolkit/openvino#27915
Depends on openvinotoolkit/openvino#28420

dmatveev · 2025-01-13T13:35:13Z

@TolyaTalamanov @AsyaPronina please review

src/cpp/src/llm_pipeline_static.cpp

TolyaTalamanov

Here is how blob logic is done for StatelessLLMPipeline:

User provides USE_BLOB option to specify its intention to use precompiled blob:

openvino.genai/src/cpp/src/llm_pipeline_static.cpp

Line 962 in 2517a0b

const auto use_blobs = pop_or_default(properties, "USE_BLOBS", false);
Along with this, user may also provide BLOB_PATH option that points to the folder where blobs are located
Depends on the model type (prefill, generation). Pipeline searches for opevino_prefill.blob and openvino_generate.blob.

Here is the proposal how it can be done for your case:

Keep (1) and (2) the same way as in StatelessLLMPipeline.
For (3) blob name can be just openvino_model.blob

This will allow user to:

Specify exact blob location if it's different from folder provided
Switch between blob and model cases

src/cpp/src/llm_pipeline_static.cpp

…genai into as/npuw_import

TolyaTalamanov · 2025-01-14T09:57:15Z

src/cpp/src/llm_pipeline_static.cpp

-    auto compiled = setupAndCompileModel(model, model_desc, properties);
-    m_request = compiled->create_infer_request();
-    m_sampler.set_seed(m_generation_config.rng_seed);
+    const auto use_blobs = pop_or_default(properties, "USE_BLOBS", false);


We only have 1 blob, so it must be USE_BLOB???

…envino.genai into as/npuw_import

Import model if blob file is present

2517a0b

smirnov-alexey added this to the 2025.0 milestone Jan 7, 2025

smirnov-alexey requested review from dmatveev, AsyaPronina and TolyaTalamanov January 7, 2025 16:29

smirnov-alexey assigned dmatveev Jan 7, 2025

github-actions bot added the category: LLM LLM pipeline (stateful, static) label Jan 7, 2025

ilya-lavrenov added the category: NPU label Jan 8, 2025

smirnov-alexey changed the title ~~[DO NOT MERGE] StaticLLMPipeline: Import model if blob file is present in Stateful pipeline~~ StaticLLMPipeline: Import model if blob file is present in Stateful pipeline Jan 8, 2025

ilya-lavrenov changed the title ~~StaticLLMPipeline: Import model if blob file is present in Stateful pipeline~~ [LLM] [NPU] StaticLLMPipeline: Import model if blob file is present in Stateful pipeline Jan 9, 2025

dmatveev assigned TolyaTalamanov and unassigned dmatveev Jan 13, 2025

TolyaTalamanov reviewed Jan 13, 2025

View reviewed changes

src/cpp/src/llm_pipeline_static.cpp Outdated Show resolved Hide resolved

TolyaTalamanov reviewed Jan 13, 2025

View reviewed changes

dmatveev added the Code Freeze label Jan 13, 2025

Address review comments

743c47e

smirnov-alexey requested a review from ilya-lavrenov January 13, 2025 20:57

Fix build

3fed330

ilya-lavrenov reviewed Jan 14, 2025

View reviewed changes

src/cpp/src/llm_pipeline_static.cpp Show resolved Hide resolved

src/cpp/src/llm_pipeline_static.cpp Show resolved Hide resolved

src/cpp/src/llm_pipeline_static.cpp Show resolved Hide resolved

smirnov-alexey added 2 commits January 14, 2025 07:45

Address review comment

f14ec05

Merge branch 'master' of https://github.com/openvinotoolkit/openvino.…

8d89a89

…genai into as/npuw_import

TolyaTalamanov enabled auto-merge January 14, 2025 09:56

TolyaTalamanov reviewed Jan 14, 2025

View reviewed changes

Update llm_pipeline_static.cpp

0022d95

TolyaTalamanov self-requested a review January 14, 2025 10:02

TolyaTalamanov approved these changes Jan 14, 2025

View reviewed changes

Merge branch 'master' into as/npuw_import

f91bcd1

smirnov-alexey disabled auto-merge January 14, 2025 15:57

Get properties

41dc2fb

Merge branch 'as/npuw_import' of https://github.com/smirnov-alexey/op…

d5a3d4b

…envino.genai into as/npuw_import

smirnov-alexey enabled auto-merge January 14, 2025 18:31

smirnov-alexey added this pull request to the merge queue Jan 14, 2025

Merged via the queue into openvinotoolkit:master with commit 2e5c2a1 Jan 14, 2025
59 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] [NPU] StaticLLMPipeline: Import model if blob file is present in Stateful pipeline #1494

[LLM] [NPU] StaticLLMPipeline: Import model if blob file is present in Stateful pipeline #1494

smirnov-alexey commented Jan 7, 2025 •

edited

Loading

dmatveev commented Jan 13, 2025

TolyaTalamanov left a comment

TolyaTalamanov Jan 14, 2025

TolyaTalamanov Jan 14, 2025

[LLM] [NPU] StaticLLMPipeline: Import model if blob file is present in Stateful pipeline #1494

[LLM] [NPU] StaticLLMPipeline: Import model if blob file is present in Stateful pipeline #1494

Conversation

smirnov-alexey commented Jan 7, 2025 • edited Loading

dmatveev commented Jan 13, 2025

TolyaTalamanov left a comment

Choose a reason for hiding this comment

TolyaTalamanov Jan 14, 2025

Choose a reason for hiding this comment

TolyaTalamanov Jan 14, 2025

Choose a reason for hiding this comment

smirnov-alexey commented Jan 7, 2025 •

edited

Loading