Context Search With Documents (Personalization) - Revised #1254

vicilliar · 2025-06-26T05:20:33Z

Updated version of personalization PR intended for 2.21.0 which was reverted due to an issue with the Vespa client.
Differences from old personalization PR:
-Use persistent SSL context instead of persistent transport, because it caused hanging during concurrent requests. Performance gains are comparable.
-Integration test added for concurrent get documents requests to ensure no hanging

Change Summary

Supports new key documents in context parameter of search. This allows for a dict of doc_id:weight pairs to be provided by user for vectors to be extracted and interpolated for tensor search or hybrid search with tensor retrieval or ranking.

Updated example context object:

    {
        "tensor": [{"vector": [1 for i in range(768)], "weight": 1}], 
        "documents": {
            "ids": {"doc1": 1},
            "parameters": {
                "excludeInputDocuments": False,
                "tensorFields": ["text_field_1"],
                "concurrency": 10
            }
        }
    }

-Interpolation is now done on all vectors (query, context.tensor, context.document) instead of previous behavior of weighted average.
-Absolute value is now used when summing vector weights in interpolation (This makes ZeroSumWeightsError obsolete, we now use AllZeroWeightsError instead)
-Interpolation logic implemented in numpy.
-Fixed some error messages for hybrid / pydantic model validation.

New Env Vars:
-MARQO_MAX_SEARCH_CONTEXT_DOCS - aximum number of documents that can be used as context documents in a search request.

The following optimizations have been implemented:
-Getting doc vectors fetches only essential embeddings fields from Vespa
-A persistent SSL Context is created upon vespa client initialization and is used for the verify param of every async client instantiation. This saves time spent creating new contexts.
-Index is fetched from cache instead of Vespa
-Fetched doc vectors are not formatted into document form with Pydantic. Vectors are just used directly

Related Jira Ticket

https://s2search.atlassian.net/jira/polaris/projects/MOSD/ideas/view/4863474?selectedIssue=MOSD-23&issueViewLayout=sidebar&issueViewSection=overview&atlOrigin=eyJpIjoiNzcyNDBkNzVjM2JhNDE0Y2I4ODUzNDY3YTc2MmE0ZmUiLCJwIjoiaiJ9

Checklist

Tests have been added for changes
Documentation has been updated
Breaking changes are clearly identified
Python client changes linked or N/A
py-marqo changes: Support for context documents in search (Personalization) py-marqo#287

For new field types:

Tests cover score modifier usage of this new type
Test indexes updated to cover the new type for all APIs (add docs, search, partial update, etc.)

vicilliar added 30 commits May 13, 2025 16:46

initial work on personalization

d209288

basic personalization implementation

3fdbb85

udpate data types

b9c6695

add new tests

609fa41

absolute value weights and add extra tests

2313a0f

debug personalization, add tests, fix error messages

978c86e

Merge branch 'mainline' into joshua/personalization-with-context

9e52f1b

remove todos

97b1d32

update broken tests

9d545d4

Merge branch 'mainline' into joshua/personalization-with-context

9b5f959

CustomVectorQuery fix

419c2e1

push version to 2.20

51fb062

fix semver version

08c619e

add new telemetry for context docs

533dcb3

draft field fetch optimization

80a5946

Merge branch 'mainline' into joshua/personalization-with-context

37c9b53

fine tune methods

1d77384

add concurrency control

f05c373

Remove pydantic parsing for embeddings fetch

c7774a7

persistent transport

b6224e4

validation for non-existent tensor fields

c85d06d

rename vespa get pool size env var

c893c7d

update env var name

6326136

add new env vars async pool size and concurrency limit

9f7f0dc

raise error when doc fetch is not successful

e8f0b2a

Merge branch 'mainline' into joshua/personalization-with-context

03dbad8

fix all li's comments

f3ce3b2

fix test import issue

2bd9ad7

fix unit tests, add legacy unstructured support

283e960

standardize unstructured legacy support and all zero weights errors

05227a6

vicilliar added 23 commits June 18, 2025 14:13

add env var tests

69506cd

Merge branch 'mainline' into joshua/personalization-with-context

dc48bcc

update telemetry key for speed test

efb6eb3

add unit tests

2dae4e4

sanitize unit tests

25ac32f

duplicate ID check

0affc78

update telemetry, block legacy unstr, add tests,

ec5b3ec

add unstructured blocking tests

671a148

integ test on legacy unstructured fixed

f9934ff

fix unit test error messages

11b6e14

add unit tests

93d2b9e

Merge branch 'mainline' into joshua/personalization-with-context

0c9ccc5

add id type validation, use conlist for tensors

df5e666

set max connections to unlimited

58b5886

hardcoded tensor field order

6476ce0

Merge branch 'mainline' into joshua/personalization-with-context

a5d08c0

use persistent ssl context instead of transport

f56198c

revert env vars

bf184f8

change vespa client tests for env vars

5e7c51b

moved tensor search tests to correct dir

450f251

update version

c720f94

Merge branch 'mainline' into joshua/personalization-with-context

58d1b95

fix version to 2.22.0

f27b45b

vicilliar requested review from farshidz and wanliAlex and removed request for farshidz July 2, 2025 06:13

Merge branch 'mainline' into joshua/personalization-with-context

63ef0b1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Context Search With Documents (Personalization) - Revised #1254

Context Search With Documents (Personalization) - Revised #1254

Uh oh!

vicilliar commented Jun 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Context Search With Documents (Personalization) - Revised #1254

Are you sure you want to change the base?

Context Search With Documents (Personalization) - Revised #1254

Uh oh!

Conversation

vicilliar commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Change Summary

Related Jira Ticket

Checklist

For new field types:

Uh oh!

Uh oh!

vicilliar commented Jun 26, 2025 •

edited

Loading