Fix attention size computation error in OpenVINO backend for LLM by zhaixuejun1993 · Pull Request #131 · ravi9/llama.cpp

zhaixuejun1993 · 2026-04-13T14:56:36Z

This pull request introduces several updates to the OpenVINO GGML decoder logic, primarily improving the detection and handling of attention-related operations and key-value cache identification. The changes enhance robustness when parsing computational graphs and ensure that certain preprocessing steps are only applied in appropriate contexts.

Improvements to attention and KV cache handling:

Improved the logic in compute_llm_params to check for deeper source validity when handling GGML_OP_SOFT_MAX, preventing potential null pointer dereferences.
Added new logic to extract attention_size from specific GGML_OP_MUL_MAT graph patterns involving permute and view operations, increasing accuracy in parameter computation for attention mechanisms.
Refactored the is_kvcache static method to prioritize buffer usage checks, making the order of conditions more robust and consistent.

Preprocessing logic improvements:

Updated the preprocess function to only add sliced masks when the decoder is stateful, preventing unnecessary operations for stateless models.## Overview

Additional information

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure:

zhaixuejun1993 requested review from cavusmustafa and wine99 as code owners April 13, 2026 14:56

github-actions bot added OpenVINO ggml labels Apr 13, 2026

OpenVINO backend: fix error for attention size compute in llm param

71ece5f

wine99 merged commit 36daf24 into ravi9:dev_backend_openvino Apr 13, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix attention size computation error in OpenVINO backend for LLM#131

Fix attention size computation error in OpenVINO backend for LLM#131
wine99 merged 1 commit intoravi9:dev_backend_openvinofrom
zhaixuejun1993:xuejun/fix-error-sttention-size

zhaixuejun1993 commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zhaixuejun1993 commented Apr 13, 2026

Additional information

Requirements

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants