[Enhancement] [Build-Time V1] Reduce Memory Footprint during Native Index Creation

## Description
While running some experiments I can am able to see that we need more memory than final graph which gets created in a segment.

**Based on some research what I can see for Faiss**

1. We give faiss, all the vectors and ids for which graph needs to be created. here: https://github.com/navneet1v/k-NN/blob/badbb1d3438a37536ce6de29c4085cded0ef2723/jni/src/faiss_wrapper.cpp#L159 .

2. After this faiss, internally calls add on the index function. here: https://github.com/facebookresearch/faiss/blob/8898eabe9f16e7bbebdc19667c993f7dc55a6a0c/faiss/IndexIDMap.cpp#L80

3. This will call internally add function of IndexHNSW, which will [first store](https://github.com/facebookresearch/faiss/blob/f884ba240ea7ce948ce850c31f82bb2e9eac21b0/faiss/IndexHNSW.cpp#L395) the vector in a [flat index](https://github.com/facebookresearch/faiss/blob/2d8886cd4fb1bd0ebe5f8de7cae4c1ad91c8a40d/faiss/IndexFlatCodes.cpp#L23-L31)(which is nothing but a binary representation of vectors) and then [start creating](https://github.com/facebookresearch/faiss/blob/f884ba240ea7ce948ce850c31f82bb2e9eac21b0/faiss/IndexHNSW.cpp#L398) the HNSW graph.

So based on the above flow I can see that vectors which are stored in native memory in step 1 and passed to Faiss are stored first in FlatIndex of Faiss doubling the native memory.


**For Nmslib:**
1. We first take the vectors and copy vectors into [this new object](https://github.com/opensearch-project/k-NN/blob/feature/stream-vectors/jni/src/nmslib_wrapper.cpp#L140-L153) of nmslib.
2. After that, we [create the index for nmslib](https://github.com/opensearch-project/k-NN/blob/feature/stream-vectors/jni/src/nmslib_wrapper.cpp#L158). 
3. After the index is created we [save the index to a file](https://github.com/opensearch-project/k-NN/blob/feature/stream-vectors/jni/src/nmslib_wrapper.cpp#L159). This index writing makes a very large allocation before it writes the index on the file here: https://github.com/nmslib/nmslib/blob/a2d6624e1315402662025debfdd614b505d9c3ef/similarity_search/src/method/hnsw.cc#L417-L428


## Issue
Due to this, we need more memory to create a graph than the output Faiss/Nmslib index at a segment level. This really throw off the memory estimations while force merging the segments. 

## Resolution
With [Streaming Vectors from Java to JNI Layer](https://github.com/opensearch-project/k-NN/issues/1506), we are already sending a limited amount of vectors from Java to JNI layer, we can use the same approach to build the graph incrementally by not storing the vectors in a memory location but directly calling faiss [add_with_ids api](https://github.com/facebookresearch/faiss/blob/8898eabe9f16e7bbebdc19667c993f7dc55a6a0c/faiss/IndexIDMap.cpp#L76-L84). This will ensure that vectors which are getting streamed from Java to JNI layer we are creating faiss index for them, so no extra memory(apart from streamed vectors) will be required. The api is available for hnsw, IVF, IVFPQ and HNSWPQ.

For Nmslib, we can start creating the object which is required to create the index of nmslib.  This should at-least avoid increase memory foot print. @jmazanec15 added the resolution here: https://github.com/opensearch-project/k-NN/issues/1600#issuecomment-2045973766


~~### Limitation
K-NN also provides nmslib as KNNEngine, but based on my deep-dive I didn't find similar feature or Nmslib. But overall I still believe we should implement this feature for Faiss Engine.~~






Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Enhancement] [Build-Time V1] Reduce Memory Footprint during Native Index Creation #1600

Description

Issue

Resolution

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Enhancement] [Build-Time V1] Reduce Memory Footprint during Native Index Creation #1600

Description

Description

Issue

Resolution

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions