Implement support to BatchAPIs to gather evidence #687

maykcaldas · 2024-11-14T22:37:01Z

This PR implements support to send requests to OpenAI and Anthropic batch APIs. Due to the parallel nature of gathering evidence and summarizing all candidate papers, we plan to use the batch API when possible.

The use of the batch API depends on Settings.use_batch_in_summary. Therefore, paperqa workflows would still be unchanged in case this setting is set to False (default). Currently, using a batch keeps the process busy while the batch isn't finished on the LLM provider side, which could take up to 24 hours. This scaling issue will be addressed in another PR.

Task list

Create a class to make batch calls to openai
Create a class to make batch calls to anthropic
Integrate the openai class to the get_evidence method
Integrate the anthropic class to the get_evidence method
Update get_summary_llm to decide which provider to use given the llm in the config
❌ Use pytest.mark.vcr in the tests to avoid creating batches for every test
Implement mock servers for testing purposes

This class is used to submit batch calls to the OpenAI batch API

paperqa/llms.py

paperqa/settings.py

paperqa/docs.py

paperqa/llms.py

also added a dependency group in pyproject.toml to install openai and anthropic only if the user wants to use batches, refactored the logic of sumarizing evidences in batch and moved the code to core.py

…ke it compatible with #680

Also bugfix in tests and created Enums to avoid hardcoding the batch status identifiers

The timelimit and the pooling time for the batches are now in the Settings

tests/test_paperqa.py

tests/test_llms.py

paperqa/core.py

tests/test_llms.py

mskarlin · 2024-11-19T18:09:11Z

paperqa/docs.py


        for _, llm_result in results:
            session.add_tokens(llm_result)

-        session.contexts += [r for r, _ in results if r is not None]
+        session.contexts += [r for r, _ in results]


why did we cut the r is not None filter here? I would think that the results from gather_with_concurrency could still be None on failure, but maybe I'm wrong

This gets the Contexts from gather_with_concurrency or gather_with_batch. And both always return list of tuples with (Context, LLMResult). What can happen is to have an empty text in Context.text, but it seems to me that r is always an instance of Context.
Also, I didn't see any case of map_fxn_summary returning None while studying the code, and mypy also complains that r is None is always a True statement.

Maybe that's an edge case that I didn't see?

If we correctly type hinted gather_with_concurrency then this would be resolved. @maykcaldas can you adjust it to be this?

T = TypeVar("T") async def gather_with_concurrency(n: int, coros: Iterable[Awaitable[T]]) -> list[T]: ... ```

paperqa/llms.py

.github/workflows/tests.yml

A more general solution is to include it i the field [dependency-groups] of pyproject.toml

…e DRYer

…e API

paperqa/core.py

paperqa/settings.py

paperqa/core.py

paperqa/docs.py

jamesbraza · 2024-11-20T18:50:32Z

paperqa/docs.py


        for _, llm_result in results:
            session.add_tokens(llm_result)

-        session.contexts += [r for r, _ in results if r is not None]
+        session.contexts += [r for r, _ in results]


If we correctly type hinted gather_with_concurrency then this would be resolved. @maykcaldas can you adjust it to be this?

T = TypeVar("T") async def gather_with_concurrency(n: int, coros: Iterable[Awaitable[T]]) -> list[T]: ... ```

paperqa/core.py

paperqa/llms.py

whitead · 2025-03-08T06:55:53Z

Closing for now - probably no longer relevant.

maykcaldas added 2 commits November 14, 2024 14:26

Implements OpenAIBatchLLMModel

2385dce

This class is used to submit batch calls to the OpenAI batch API

Incorporates OpenAIBatchLLMModel to get_evidence

8a21055

maykcaldas self-assigned this Nov 14, 2024

jamesbraza reviewed Nov 15, 2024

View reviewed changes

paperqa/llms.py Outdated Show resolved Hide resolved

paperqa/llms.py Outdated Show resolved Hide resolved

paperqa/settings.py Outdated Show resolved Hide resolved

mskarlin reviewed Nov 15, 2024

View reviewed changes

paperqa/docs.py Outdated Show resolved Hide resolved

mskarlin reviewed Nov 15, 2024

View reviewed changes

paperqa/llms.py Outdated Show resolved Hide resolved

mskarlin reviewed Nov 15, 2024

View reviewed changes

paperqa/llms.py Outdated Show resolved Hide resolved

maykcaldas and others added 9 commits November 15, 2024 09:10

Merge branch 'main' into batch_api

5f59681

Merge branch 'main' into batch_api

6f7bbb5

Started anthropic batch api support implementation

e8dc0d0

also added a dependency group in pyproject.toml to install openai and anthropic only if the user wants to use batches, refactored the logic of sumarizing evidences in batch and moved the code to core.py

Removed the skip_system argument from the new classes and tests to ma…

899de43

…ke it compatible with #680

Switched to async OpenAI client

16c3988

Also bugfix in tests and created Enums to avoid hardcoding the batch status identifiers

Added logging to the batch processing

d10a268

The timelimit and the pooling time for the batches are now in the Settings

Created mock server to test openAI batch API

0fe9aa1

Implemented batch support to Anthropic

a9ad540

Merge branch 'main' into batch_api

9a0a6c4

jamesbraza reviewed Nov 18, 2024

View reviewed changes

tests/test_llms.py Outdated Show resolved Hide resolved

maykcaldas added 5 commits November 18, 2024 14:31

Updated uv.lock to include imports for the batch API

723650d

Implements tests with a mocked server for anthropic

660bfa0

Added type hints to satisfy the pre-commit

977a025

Merge branch 'main' into batch_api

ee351f2

Updates uv on github actions to include extra requirements

293658a

maykcaldas marked this pull request as ready for review November 19, 2024 17:55

dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Nov 19, 2024

maykcaldas requested review from jamesbraza and mskarlin November 19, 2024 17:55

dosubot bot added the enhancement New feature or request label Nov 19, 2024

maykcaldas requested a review from whitead November 19, 2024 17:56

maykcaldas changed the title ~~[WIP] Implement support to BatchAPIs to gather evidence~~ Implement support to BatchAPIs to gather evidence Nov 19, 2024

mskarlin reviewed Nov 19, 2024

View reviewed changes

paperqa/llms.py Outdated Show resolved Hide resolved

jamesbraza reviewed Nov 19, 2024

View reviewed changes

.github/workflows/tests.yml Outdated Show resolved Hide resolved

maykcaldas and others added 3 commits November 19, 2024 11:24

Removed the --all-extras flag from uv in github workflow

1ad1c7c

A more general solution is to include it i the field [dependency-groups] of pyproject.toml

Refactored OpenAiBatchStatus and AnthropicBatchStatus to make the cod…

af32005

…e DRYer

[pre-commit.ci lite] apply automatic fixes

63e4b39

maykcaldas requested a review from mskarlin November 19, 2024 19:46

maykcaldas and others added 9 commits November 19, 2024 12:20

Merge branch 'main' into batch_api

f61e629

Cleaned unneeded comments

d7dbd72

Updated the way the system message is passed to anthropic

7c37f6d

changed how the file is passed to openai

de18907

[pre-commit.ci lite] apply automatic fixes

3e72bd4

Avoided writing to a file when sending the batch to openAi

7c7f4b8

Skipped writing a file. Instead, the content is directly passed to th…

6c8f186

…e API

Merge branch 'main' into batch_api

0e43a7c

Fixed lint error

17c26eb

maykcaldas requested review from jamesbraza and nadolskit November 20, 2024 17:50

jamesbraza reviewed Nov 20, 2024

View reviewed changes

paperqa/core.py Outdated Show resolved Hide resolved

paperqa/core.py Outdated Show resolved Hide resolved

paperqa/settings.py Show resolved Hide resolved

paperqa/settings.py Outdated Show resolved Hide resolved

maykcaldas added 2 commits November 20, 2024 10:40

Updated the batch time limit settings name

c258306

Removed type hints from docstrings in gather_with_batch

4b8e1c3

jamesbraza reviewed Nov 20, 2024

View reviewed changes

Added exception in map_fxn_summary to treat multiple reponses

8b5c1fa

jamesbraza reviewed Nov 20, 2024

View reviewed changes

paperqa/core.py Show resolved Hide resolved

paperqa/llms.py Outdated Show resolved Hide resolved

paperqa/llms.py Outdated Show resolved Hide resolved

paperqa/llms.py Outdated Show resolved Hide resolved

paperqa/llms.py Outdated Show resolved Hide resolved

Added a description explaining the llm_type attribute

ab40b54

maykcaldas mentioned this pull request Nov 20, 2024

Fixes gather_with_concurrency typing #714

Merged

whitead closed this Mar 8, 2025

Implement support to BatchAPIs to gather evidence #687

Implement support to BatchAPIs to gather evidence #687

Uh oh!

Conversation

maykcaldas commented Nov 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mskarlin Nov 19, 2024

Choose a reason for hiding this comment

Uh oh!

maykcaldas Nov 19, 2024

Choose a reason for hiding this comment

Uh oh!

jamesbraza Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jamesbraza Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

whitead commented Mar 8, 2025

Uh oh!

Uh oh!

maykcaldas commented Nov 14, 2024 •

edited

Loading