Skip to content

feat: support merge_insert with blob encoding column#7615

Open
nyl3532016 wants to merge 1 commit into
lance-format:mainfrom
nyl3532016:main
Open

feat: support merge_insert with blob encoding column#7615
nyl3532016 wants to merge 1 commit into
lance-format:mainfrom
nyl3532016:main

Conversation

@nyl3532016

Copy link
Copy Markdown
Contributor

merge_insert directly invokes session_ctx.read_lance_unordered without specifying Blob column handling, which defaults to BlobsDescriptions. Consequently, DataFusion retrieves Blob data as a Struct type description when scanning the original dataset, causing a crash with "LargeBinary vs Struct schema mismatch!" because the table's logical schema expects LargeBinary.
Interestingly, this only happens if the input data for merge_insert lacks Blob columns; if Blob columns are present in the input, column pruning is triggered during the original data read, bypassing the Blob columns and masking the bug.

@github-actions github-actions Bot added the enhancement New feature or request label Jul 3, 2026
@codecov

codecov Bot commented Jul 4, 2026

Copy link
Copy Markdown

Codecov Report

❌ Patch coverage is 91.00000% with 9 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
rust/lance/src/dataset/write/merge_insert.rs 90.52% 9 Missing ⚠️

📢 Thoughts on this report? Let us know!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant