Add RAGAnything processing to LightRAG's webui #2042

hzywhite · 2025-09-01T20:02:57Z

Overview

This document outlines the key differences between the current working branch and the main branch, focusing on the integration of RAGAnything functionality into the LightRAG server.

Operating Procedure （Message on September 16th, 2025）

1.Install RAG-Anything

git clone https://github.com/HKUDS/RAG-Anything.git
cd raganything
pip install -e ".[all]"

2.Install the RAGAnything branch of Lightrag

git clone -b RAGAnything https://github.com/HKUDS/LightRAG.git
cd lightrag
pip install -e ".[api]"

3.Add .env file to lightrag and start running it

cd lightrag
lightrag-server

Modified Files

`lightrag/api/lightrag_server.py`

New Imports

RAGManager: Added import for RAGManager from lightrag.ragmanager
RAGAnything: Added import for RAGAnything and RAGAnythingConfig from raganything

Key Changes

1. Enhanced LightRAG Initialization

rag = LightRAG(
    working_dir=args.working_dir,
    workspace=args.workspace,
    input_dir=args.input_dir,  # New parameter added
    # ... existing parameters
)

Change: Added input_dir parameter to the LightRAG initialization.

2. RAGAnything Configuration Setup

config = RAGAnythingConfig(
    working_dir=args.working_dir or "./rag_storage",
    parser="mineru",  # Parser selection: mineru or docling
    parse_method="auto",  # Parse method: auto, ocr, or txt
    enable_image_processing=True,
    enable_table_processing=True,
    enable_equation_processing=True,
)

Purpose: Configures RAGAnything with comprehensive document processing capabilities including:

Parser Options: Support for mineru or docling parsers
Parse Methods: Automatic, OCR, or text-based parsing
Processing Features: Image, table, and equation processing enabled

3. LLM Model Function Definition

def llm_model_func(prompt, system_prompt=None, history_messages=[], **kwargs):
    return openai_complete_if_cache(
        "gpt-4o-mini",
        prompt,
        system_prompt=system_prompt,
        history_messages=history_messages,
        api_key=api_key,
        base_url=base_url,
        **kwargs,
    )

Feature: Standardized LLM interaction using GPT-4o-mini with caching support.

4. Vision Model Function for Image Processing

def vision_model_func(
        prompt, system_prompt=None, history_messages=[], image_data=None, **kwargs
):
    if image_data:
        return openai_complete_if_cache(
            "gpt-4o",
            "",
            system_prompt=None,
            history_messages=[],
            messages=[
                {"role": "system", "content": system_prompt}
                if system_prompt
                else None,
                {
                    "role": "user",
                    "content": [
                        {"type": "text", "text": prompt},
                        {
                            "type": "image_url",
                            "image_url": {
                                "url": f"data:image/jpeg;base64,{image_data}"
                            },
                        },
                    ],
                }
                if image_data
                else {"role": "user", "content": prompt},
            ],
            api_key=api_key,
            base_url=base_url,
            **kwargs,
        )
    else:
        return llm_model_func(prompt, system_prompt, history_messages, **kwargs)

Capability: Enhanced vision processing using GPT-4o for image analysis with base64 encoding support.

5. Embedding Function Configuration

embedding_func = EmbeddingFunc(
    embedding_dim=3072,
    max_token_size=8192,
    func=lambda texts: openai_embed(
        texts,
        model="text-embedding-3-large",
        api_key=api_key,
        base_url=base_url,
    ),
)

Specifications:

Embedding Dimension: 3072
Max Token Size: 8192
Model: text-embedding-3-large

6. RAGAnything Initialization

rag_anything = RAGAnything(
    lightrag=rag,
    config=config,
    llm_model_func=llm_model_func,
    vision_model_func=vision_model_func,
    embedding_func=embedding_func,
)
logger.info("检查raganything的parser下载情况")
rag_anything.verify_parser_installation_once()

RAGManager.set_rag(rag_anything)

Integration:

Combines LightRAG with RAGAnything capabilities
Verifies parser installation
Registers with RAGManager for centralized access

7. Updated Route Creation

app.include_router(
    create_document_routes(
        rag,
        rag_anything,  # New parameter added
        doc_manager,
        api_key,
    )
)

Enhancement: Document routes now receive both rag and rag_anything instances for comprehensive document processing.

Summary of New Capabilities

Enhanced Document Processing

Multi-format Support: Handles various document formats through advanced parsers
Visual Content Processing: Processes images, tables, and equations within documents
Flexible Parsing: Supports automatic, OCR, and text-based parsing methods

Improved AI Integration

Dual Model Support: Separate functions for text and vision processing
Advanced Embeddings: High-dimensional embeddings for better semantic understanding
Caching Optimization: Built-in caching for improved performance

Architecture Improvements

Centralized Management: RAGManager provides unified access to RAG capabilities
Modular Design: Clear separation between LightRAG and RAGAnything functionalities
Enhanced API: Document routes now support extended processing capabilities

Configuration Requirements

Environment Variables

LLM_BINDING_API_KEY: API key for LLM services
LLM_BINDING_HOST: Base URL for LLM services

Dependencies

raganything: New dependency for enhanced document processing
Parser dependencies (mineru/docling) for document parsing

Notes

The integration maintains backward compatibility with existing LightRAG functionality
New features are additive and don't break existing workflows
Parser verification ensures proper setup before operation

machester4 · 2025-09-09T22:22:01Z

@hzywhite Hi, I'm interested in using this feature. Are you ready to merge?

apkdmg · 2025-09-10T06:07:52Z

Hi is there any reason why this PR has not yet been merged? The tests all passed and looks fine

x-0D · 2025-09-13T09:19:22Z

I found LightRAG from RAGAnything repo as Ready to use RAG solution that uses RAGAnything. I found RAGAnything while searching ready to use RAG with mineru backend for PDF processing, and very disappointing that LightRAG does not support RAGAnything out of the box. This MR is MUST HAVE for LightRAG, just add new environment variable to choose which backend to use.

ERROR:  RAGAnything initialization failed: 'RAGAnything' object has no attribute 'verify_parser_installation_once'

FIX:

pip install "lightrag-hku[api] @ git+https://github.com/HKUDS/LightRAG.git@RAGAnything"

# Install upstream raganything after lightrag.
pip install "raganything[all] @ git+https://github.com/HKUDS/RAG-Anything.git"

ERROR /documents/paginated HTTP/1.1 500

INFO: 127.0.0.1:53198 - "POST /documents/paginated HTTP/1.1" 500
ERROR: Error getting paginated documents: 1 validation error for DocStatusResponse
scheme_name
  Input should be a valid string [type=string_type, input_value=None, input_type=NoneType]
    For further information visit https://errors.pydantic.dev/2.11/v/string_type
ERROR: Traceback (most recent call last):
  File "/Users/appleroot/projects/RANY/.venv/lib/python3.13/site-packages/lightrag/api/routers/document_routes.py", line 2935, in get_documents_paginated
    DocStatusResponse(
    ~~~~~~~~~~~~~~~~~^
        id=doc_id,
        ^^^^^^^^^^
    ...<11 lines>...
        multimodal_content=doc.multimodal_content,
        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    )
    ^
  File "/Users/appleroot/projects/RANY/.venv/lib/python3.13/site-packages/pydantic/main.py", line 253, in __init__
    validated_self = self.__pydantic_validator__.validate_python(data, self_instance=self)
pydantic_core._pydantic_core.ValidationError: 1 validation error for DocStatusResponse
scheme_name
  Input should be a valid string [type=string_type, input_value=None, input_type=NoneType]
    For further information visit https://errors.pydantic.dev/2.11/v/string_type

FIX:

# DELETE YOUR PREVIOUS RAG DATA
rm -rf inputs/ lightrag.log  rag_storage/

ERROR:

IN UI:
422 Unprocessable Content {"detail":[{"type":"missing","loc":["body","schemeId"],"msg":"Field required","input":null}]} /documents/upload

IN BACKEND:
INFO: 127.0.0.1:53353 - "POST /documents/upload HTTP/1.1" 422

FIX: Restart Browser or Right Click Reload Page button -> Clear Cache and Hard Reload

ERROR:root:Error in parse_pdf: MineruParser._run_mineru_command() got an unexpected keyword argument 'parser'

FIX: HKUDS/RAG-Anything#113

pip install "mineru[core] @ git+https://github.com/opendatalab/MinerU.git"
pip install "raganything[all] @ git+https://github.com/HKUDS/RAG-Anything.git@ui"

So, here are commands i did for clean install LightRAG with MinerU support.

mkdir my-rag && cd my-rag

# Create python venv in new folder
python3 -m venv .venv
. .venv/bin/activate

# Install correct combination of packages
pip install "lightrag-hku[api] @ git+https://github.com/HKUDS/LightRAG.git@RAGAnything"
pip install "mineru[core] @ git+https://github.com/opendatalab/MinerU.git"
pip install "raganything[all] @ git+https://github.com/HKUDS/RAG-Anything.git@ui"

# save env.example file
wget "https://raw.githubusercontent.com/HKUDS/LightRAG/refs/heads/RAGAnything/env.example"

# copy and edit .env file
cp env.example .env
# nano .env

# launch server
lightrag-server

7frank · 2025-09-21T11:22:41Z

@hzywhite also am waiting for this to be merged, but I'm curious lightrag/api/webui/assets whats with all these compiled assets. They don't look to be intentionally there, at least IMO they shouldn't.

danielaskdd · 2025-09-22T10:37:48Z

This is a highly anticipated feature, and I’ll be able to dedicate time to researching and testing it only after addressing my current tasks. Please resolve the conflicts with the main branch first. Thank you.

danielaskdd · 2025-09-22T14:56:35Z

@hzywhite also am waiting for this to be merged, but I'm curious lightrag/api/webui/assets whats with all these compiled assets. They don't look to be intentionally there, at least IMO they shouldn't.

This is to clone the repository and run the server without rebuilding the frontend project.

7frank · 2025-10-09T19:25:39Z

@hzywhite also am waiting for this to be merged, but I'm curious lightrag/api/webui/assets whats with all these compiled assets. They don't look to be intentionally there, at least IMO they shouldn't.

This is to clone the repository and run the server without rebuilding the frontend project.

@danielaskdd
Thanks for the reply. Just to clarify, does the CI pipeline verify or rebuild these compiled assets to prevent the possibility of malicious code being injected through a PR by a bad-faith contributor?

danielaskdd · 2025-10-10T03:14:19Z

@hzywhite also am waiting for this to be merged, but I'm curious lightrag/api/webui/assets whats with all these compiled assets. They don't look to be intentionally there, at least IMO they shouldn't.

This is to clone the repository and run the server without rebuilding the frontend project.

@danielaskdd Thanks for the reply. Just to clarify, does the CI pipeline verify or rebuild these compiled assets to prevent the possibility of malicious code being injected through a PR by a bad-faith contributor?

The CI pipeline-generated frontend build code cannot be directly added to the repository, correct? Are you suggesting that the CI pipeline should build the frontend assets and push them to PyPI instead? I don't have experience with this process—could you please share your insights?

7frank · 2025-10-13T09:23:37Z

@danielaskdd Thanks for the reply. Just to clarify, does the CI pipeline verify or rebuild these compiled assets to prevent the possibility of malicious code being injected through a PR by a bad-faith contributor?

The CI pipeline-generated frontend build code cannot be directly added to the repository, correct? Are you suggesting that the CI pipeline should build the frontend assets and push them to PyPI instead? I don't have experience with this process—could you please share your insights?

To keep this thread focused, I opened a new issue

7frank · 2025-10-14T13:08:58Z

lightrag/api/lightrag_server.py

+    raganything_error_message = None
+
+    try:
+        api_key = get_env_value("LLM_BINDING_API_KEY", "", str)


I tested locally and I think this will override api_key = os.getenv("LIGHTRAG_API_KEY") or args.key and pass the wrong api_key to create_document_routes

sicarius97 · 2025-10-28T16:50:01Z

@7frank I also tested this locally and I think you are incorrect. The api key does not get overridden and that edit is on purpose so that raganything uses the same llm binding as lightrag. My only comment would be that the queries don't utilize the vlm enhanced query from raganything and there should probably be two additional query routes for raganything queries. Other than that, this works great for me locally following the simple instructions included on the PR

Lacking a merge on these is pretty inconvenient for the time being. Raganything enhances lightrag 1000x

7frank · 2025-10-29T08:42:18Z

@sicarius97

@7frank I also tested this locally and I think you are incorrect. The api key does not get overridden and that edit is on purpose so that raganything uses the same llm binding as lightrag. My only comment would be that the queries don't utilize the vlm enhanced query from raganything and there should probably be two additional query routes for raganything queries. Other than that, this works great for me locally following the simple instructions included on the PR

Lacking a merge on these is pretty inconvenient for the time being. Raganything enhances lightrag 1000x

Possible. Did you try accessing LightRAG via the API or through the web frontend?

In my case, I wanted to access the LightRAG API using a simple MCP. Before the merge, this worked without authentication. After the merge, all routes suddenly required an API key that was never set — LIGHTRAG_API_KEY.

My experience was as follows:

Routes such as the documents route now use an API key for authentication. See here:

LightRAG/lightrag/api/lightrag_server.py

Lines 762 to 769 in 9bc5f15

    
           app.include_router( 
        
               create_document_routes( 
        
                   rag, 
        
                   rag_anything, 
        
                   doc_manager, 
        
                   api_key, 
        
               ) 
        
           )

The API key they use is LIGHTRAG_API_KEY, defined here:

LightRAG/lightrag/api/lightrag_server.py

Line 199 in 9bc5f15

api_key = os.getenv("LIGHTRAG_API_KEY") or args.key

However, in the PR, it was overridden by this line:

LightRAG/lightrag/api/lightrag_server.py

Line 631 in 9bc5f15

api_key = get_env_value("LLM_BINDING_API_KEY", "", str)

As a result, when using LightRAG via the API (not the web frontend), it now returns 401 errors because the API requires a key, even though no LIGHTRAG_API_KEY is set.

I renamed the variable for the local scope, and the error disappeared.

liwu96 · 2025-11-19T10:07:59Z

Really looking forward to the merger of raganything and the lightrag server

defaultdigital1 · 2025-11-21T14:14:04Z

Is it possible to help here to getting this done? Looking really forward.

hzywhite and others added 21 commits September 2, 2025 03:54

summary

d8b2264

Summary

36c8103

summary

745aa08

Update document_routes.py

cb00359

Update document_routes.py

bd53378

merge

e270315

merge

82a0f8c

merge

7c8db78

webui

2a453fb

merge

e3ea87d

merge

8d80023

merge

482a09d

merge

e07d4bb

merge

a33484b

summary

2784502

summary

8d4ef25

summary

8bd8888

Update __init__.py

8620ce0

summary

0dc11e0

Summary

173baf9

Merge branch 'main' into RAGAnything

12028a3

hzywhite added 3 commits September 10, 2025 11:23

merge

d0709d5

merge

fb3dd9d

merge

b9f7e14

hzywhite and others added 2 commits September 16, 2025 14:32

merge

680b7c5

Merge branch 'main' into RAGAnything

9bc5f15

7frank mentioned this pull request Oct 13, 2025

[Feature Request]: PR should not contain compiled frontend artifacts #2204

Closed

2 tasks

7frank reviewed Oct 14, 2025

View reviewed changes

Add RAGAnything processing to LightRAG's webui #2042

Are you sure you want to change the base?

Add RAGAnything processing to LightRAG's webui #2042

Uh oh!

Conversation

hzywhite commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Operating Procedure （Message on September 16th, 2025）

1.Install RAG-Anything

2.Install the RAGAnything branch of Lightrag

3.Add .env file to lightrag and start running it

Modified Files

lightrag/api/lightrag_server.py

New Imports

Key Changes

1. Enhanced LightRAG Initialization

2. RAGAnything Configuration Setup

3. LLM Model Function Definition

4. Vision Model Function for Image Processing

5. Embedding Function Configuration

6. RAGAnything Initialization

7. Updated Route Creation

Summary of New Capabilities

Enhanced Document Processing

Improved AI Integration

Architecture Improvements

Configuration Requirements

Environment Variables

Dependencies

Notes

Uh oh!

machester4 commented Sep 9, 2025

Uh oh!

apkdmg commented Sep 10, 2025

Uh oh!

x-0D commented Sep 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

7frank commented Sep 21, 2025

Uh oh!

danielaskdd commented Sep 22, 2025

Uh oh!

danielaskdd commented Sep 22, 2025

Uh oh!

7frank commented Oct 9, 2025

Uh oh!

danielaskdd commented Oct 10, 2025

Uh oh!

7frank commented Oct 13, 2025

Uh oh!

7frank Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sicarius97 commented Oct 28, 2025

Uh oh!

7frank commented Oct 29, 2025

Uh oh!

liwu96 commented Nov 19, 2025

Uh oh!

defaultdigital1 commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

hzywhite commented Sep 1, 2025 •

edited

Loading

`lightrag/api/lightrag_server.py`

x-0D commented Sep 13, 2025 •

edited

Loading

7frank Oct 14, 2025 •

edited

Loading