Adding llama instead of e5 to notebook (#445)

arjunpatel7 · web-flow · commit f1efb7e51f27 · 2025-05-15T15:21:35.000-05:00
## Problem

Describe the purpose of this change. What problem is being solved and
why?
Replace me5 with llama per tej request

## Solution

Describe the approach you took. Link to any relevant bugs, issues, docs,
or other resources.

## Type of Change

- [ ] Bug fix (non-breaking change which fixes an issue)
- [x] New feature (non-breaking change which adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to not work as expected)
- [ ] This change requires a documentation update
- [ ] Infrastructure change (CI configs, etc)
- [ ] Non-code change (docs, etc)
- [ ] None of the above: (explain here)

## Test Plan

Describe specific steps for validating this change.
diff --git a/docs/semantic-search.ipynb b/docs/semantic-search.ipynb
@@ -146,8 +146,9 @@
         "Integrated Inference allows you to specify the creation of a Pinecone index with a specific Pinecone-hosted embedding model, which makes it easy to interact with the index. To learn more about Integrated Inference, including what other models are available, take a [look here](https://docs.pinecone.io/guides/get-started/overview#integrated-embedding).\n",
         "\n",
         "\n",
-        "Here, we specify a starter tier index with the [multilingual-e5-large](https://docs.pinecone.io/models/multilingual-e5-large) embedding model. We also specify a mapping for what field in our\n",
-        "records we will embed with this model. Then, we grab the index we just created for embedding later."
+        "Here, we specify a starter tier index with the [llama-text-embed-v2](https://docs.pinecone.io/models/llama-text-embed-v2) embedding model. We also specify a mapping for what field in our records we will embed with this model. Then, we grab the index we just created for embedding later.\n",
+        "\n",
+        "Want to instead embed a subset with multiple languages? Use the [multilingual-e5-large model](https://docs.pinecone.io/models/multilingual-e5-large) and simply specify this inplace of the previous model when creating an index."
       ]
     },
     {
@@ -167,8 +168,8 @@
               "{'dimension': 1024,\n",
               " 'index_fullness': 0.0,\n",
               " 'metric': 'cosine',\n",
-              " 'namespaces': {'english-sentences': {'vector_count': 416}},\n",
-              " 'total_vector_count': 416,\n",
+              " 'namespaces': {},\n",
+              " 'total_vector_count': 0,\n",
               " 'vector_type': 'dense'}"
             ]
           },
@@ -187,7 +188,9 @@
         "        cloud=\"aws\",\n",
         "        region=\"us-east-1\",\n",
         "        embed={\n",
-        "            \"model\":\"multilingual-e5-large\",\n",
+        "            # Use this if you want to instead embed non-english or a multilingual subset of the data\n",
+        "            #\"model\":\"multilingual-e5-large\",\n",
+        "            \"model\": \"llama-text-embed-v2\",\n",
         "            \"field_map\":{\"text\": \"chunk_text\"}\n",
         "        }\n",
         "    )\n",
@@ -241,7 +244,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 13,
+      "execution_count": 6,
       "metadata": {},
       "outputs": [
         {
@@ -255,7 +258,7 @@
               "  {'en': 'I have to go to sleep.', 'es': 'Tengo que irme a dormir.'}]}"
             ]
           },
-          "execution_count": 13,
+          "execution_count": 6,
           "metadata": {},
           "output_type": "execute_result"
         }
@@ -266,9 +269,18 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 6,
+      "execution_count": 7,
       "metadata": {},
-      "outputs": [],
+      "outputs": [
+        {
+          "name": "stderr",
+          "output_type": "stream",
+          "text": [
+            "Filter: 100%|██████████| 214127/214127 [00:00<00:00, 439387.27 examples/s]\n",
+            "Flattening the indices: 100%|██████████| 416/416 [00:00<00:00, 237004.95 examples/s]\n"
+          ]
+        }
+      ],
       "source": [
         "keywords= [\"park\"]\n",
         "\n",
@@ -295,6 +307,8 @@
         "    translation_pairs = translation_pairs.flatten()\n",
         "    translation_pairs = translation_pairs.shuffle(seed=1)\n",
         "\n",
+        "    # If you want to include the spanish subset, simply repeat the below steps with \"es\" instead of \"en\"\n",
+        "    # Be sure to create your index with multilingual-e5-large as well in this case!\n",
         "    english_sentences = translation_pairs.rename_column(\"translation.en\", \"text\").remove_columns(\"translation.es\")\n",
         "\n",
         "    # add lang column to indicate embedding origin\n",
@@ -346,7 +360,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 7,
+      "execution_count": 8,
       "metadata": {
         "colab": {
           "base_uri": "https://localhost:8080/",
@@ -373,7 +387,7 @@
           "name": "stderr",
           "output_type": "stream",
           "text": [
-            "Upserting records batch: 100%|██████████| 5/5 [00:04<00:00,  1.18it/s]\n"
+            "Upserting records batch: 100%|██████████| 5/5 [00:02<00:00,  1.79it/s]\n"
           ]
         }
       ],
@@ -420,32 +434,32 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 8,
+      "execution_count": 12,
       "metadata": {},
       "outputs": [
         {
           "name": "stdout",
           "output_type": "stream",
           "text": [
-            "Sentence: I have the afternoon off today, so I plan to go to the park, sit under a tree and read a book. Semantic Similarity Score: 0.8618775606155396\n",
+            "Sentence: I have the afternoon off today, so I plan to go to the park, sit under a tree and read a book. Semantic Similarity Score: 0.4675264060497284\n",
             "\n",
-            "Sentence: Let's go to the park where it's not noisy. Semantic Similarity Score: 0.8588659167289734\n",
+            "Sentence: I went to the park to play tennis. Semantic Similarity Score: 0.4330753684043884\n",
             "\n",
-            "Sentence: Let's go to the park where it is not noisy. Semantic Similarity Score: 0.8588587045669556\n",
+            "Sentence: I go to the park. Semantic Similarity Score: 0.4261631369590759\n",
             "\n",
-            "Sentence: Let's go to the park where it isn't noisy. Semantic Similarity Score: 0.858812153339386\n",
+            "Sentence: I went to the park yesterday. Semantic Similarity Score: 0.42239895462989807\n",
             "\n",
-            "Sentence: I go to the park. Semantic Similarity Score: 0.858041524887085\n",
+            "Sentence: I went to the park last Sunday. Semantic Similarity Score: 0.42069774866104126\n",
             "\n",
-            "Sentence: I'll go to the park. Semantic Similarity Score: 0.8502914905548096\n",
+            "Sentence: I like going for a walk in the park. Semantic Similarity Score: 0.41970351338386536\n",
             "\n",
-            "Sentence: I like going for a walk in the park. Semantic Similarity Score: 0.847651481628418\n",
+            "Sentence: I went to the park last Saturday. Semantic Similarity Score: 0.4103226661682129\n",
             "\n",
-            "Sentence: Let's take a walk in the park. Semantic Similarity Score: 0.8399631977081299\n",
+            "Sentence: I need light plates because today my family is going to eat lunch in the park. Semantic Similarity Score: 0.40211308002471924\n",
             "\n",
-            "Sentence: Who wants to go to the park? Semantic Similarity Score: 0.8391842842102051\n",
+            "Sentence: Linda went to the park to listen to music. Semantic Similarity Score: 0.4012303650379181\n",
             "\n",
-            "Sentence: Do you like to walk in the park? Semantic Similarity Score: 0.8343247771263123\n",
+            "Sentence: I'll go to the park. Semantic Similarity Score: 0.3996794819831848\n",
             "\n"
           ]
         }
@@ -476,32 +490,32 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 9,
+      "execution_count": 13,
       "metadata": {},
       "outputs": [
         {
           "name": "stdout",
           "output_type": "stream",
           "text": [
-            "Sentence: Where can I park? Semantic Similarity Score: 0.8843114376068115\n",
+            "Sentence: I can't find a spot to park my spaceship. Semantic Similarity Score: 0.44190075993537903\n",
             "\n",
-            "Sentence: Where can I park? Semantic Similarity Score: 0.8841626048088074\n",
+            "Sentence: I can't find a spot to park my spaceship. Semantic Similarity Score: 0.44190075993537903\n",
             "\n",
-            "Sentence: Where can we park? Semantic Similarity Score: 0.8696897625923157\n",
+            "Sentence: There isn't anywhere else to park. Semantic Similarity Score: 0.4017431437969208\n",
             "\n",
-            "Sentence: Where can I park my car? Semantic Similarity Score: 0.8663355112075806\n",
+            "Sentence: I have to park my car here. Semantic Similarity Score: 0.3978813886642456\n",
             "\n",
-            "Sentence: May I park here for a while? Semantic Similarity Score: 0.864980161190033\n",
+            "Sentence: Where can I park? Semantic Similarity Score: 0.39125218987464905\n",
             "\n",
-            "Sentence: I'm double-parked. Could you hurry it up? Semantic Similarity Score: 0.8629273176193237\n",
+            "Sentence: Where can I park? Semantic Similarity Score: 0.39125218987464905\n",
             "\n",
-            "Sentence: I'm double-parked. Could you hurry it up? Semantic Similarity Score: 0.8629273176193237\n",
+            "Sentence: I am parking my car near the office. Semantic Similarity Score: 0.37668246030807495\n",
             "\n",
-            "Sentence: I'm double-parked. Could you hurry it up? Semantic Similarity Score: 0.8626684546470642\n",
+            "Sentence: May I park here for a while? Semantic Similarity Score: 0.3707844614982605\n",
             "\n",
-            "Sentence: I'm double-parked. Could you hurry it up? Semantic Similarity Score: 0.8626684546470642\n",
+            "Sentence: I parked on the left side of the street just in front of the school. Semantic Similarity Score: 0.37002164125442505\n",
             "\n",
-            "Sentence: \"May I park here?\" \"No, you can't.\" Semantic Similarity Score: 0.8602052927017212\n",
+            "Sentence: Where can I park my car? Semantic Similarity Score: 0.3609045743942261\n",
             "\n"
           ]
         }
@@ -549,7 +563,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 10,
+      "execution_count": 11,
       "metadata": {
         "id": "-cWdeKzhAtww"
       },