[MLOB-2522] Add `--llmobs` flag for instrumenting Lambdas with LLM Observability #1603

sabrenner · 2025-04-02T19:07:55Z

What and why?

This PR adds a --llmobs flag to the AWS Lambda instrument/uninstrument commands, which takes in a string. This sets

DD_LLMOBS_ENABLED="true"
DD_LLMOBS_AGENTLESS_ENABLED="false" (enabled as part of feat: add llmobs proxy paths to trace agent datadog-lambda-extension#628, our instructions will point users to make sure to install the extension layer)
DD_LLMOBS_ML_APP to the string provided

Some additional changes/explanations/clarifications

Our documentation will highlight intended usage as:

datadog-ci lambda instrument -f <YOUR_LAMBDA_FUNCTION_NAME> -r <AWS_REGION> -v 106 -e 73 --llmobs <YOUR_ML_APP>

i.e, this should be used to instrument one layer, with both the language layer and extension layer specified (layer versions will be auto-populated).

With both layers, these three variables are the only ones needed to enable LLM Observability. DD_LLMOBS_AGENTLESS_ENABLED="false" will use the extension layer's agent as a proxy, enabled by feat: add llmobs proxy paths to trace agent datadog-lambda-extension#628

How?

Adds an --llmobs option to read, and parses it onto the llmobsMlApp setting, which if it is set, sets DD_LLMOBS_ENABLED, DD_LLMOBS_AGENTLESS_ENABLED, and DD_LLMOBS_ML_APP. This is my first time contributing, so I did my best to add tests where I saw appropriate. Happy to add more where I should if I missed some spots!

Review checklist

Feature or bugfix MUST have appropriate tests (unit, integration)

datadog-datadog-prod-us1 · 2025-04-03T14:28:34Z

Datadog Report

Branch report: sabrenner/lambda-instrument-llmobs
Commit report: f9cfa24
Test service: datadog-ci-tests

✅ 0 Failed, 1272 Passed, 0 Skipped, 1m 46.23s Total duration (34.48s time saved)

duncanista · 2025-04-07T14:25:10Z

src/commands/lambda/README.md

@@ -106,6 +106,7 @@ You can pass the following arguments to `instrument` to specify its behavior. Th
 | `--upload-git-metadata`        | `-u`      | Whether to enable Git metadata uploading, as a part of source code integration. Git metadata uploading is only required if you don't have the Datadog Github Integration installed.                                                                                                                                                           | `true`  |
 | `--no-upload-git-metadata`     |           | Disables Git metadata uploading, as a part of source code integration. Use this flag if you have the Datadog Github Integration installed, as it renders Git metadata uploading unnecessary.                                                                                                                                                  |         |
 | `--apm-flush-deadline`         |           | Used to determine when to submit spans before a timeout occurs, in milliseconds. When the remaining time in an AWS Lambda invocation is less than the value set, the tracer attempts to submit the current active spans and all finished spans. Supported for NodeJS and Python. Defaults to `100` milliseconds.                              |         |
+| `--llmobs`                     |           | If specified, enables LLM Observability for the instrumented function(s) with the provided ML application name.                                                                                                                                                                                                                               |         |               


Just curious, what happens if you don't set the app name but enable LLMObs? as in, just the two env vars

"DD_LLMOBS_ENABLED": "true", "DD_LLMOBS_AGENTLESS_ENABLED": "false"

What would happen to the ML_APP name?

yeah good question. when I tried doing this in the tests, ie

const code = await cli.run( [ 'lambda', 'instrument', // ... '--llmobs', // 'my-ml-app', ], context )

i got a mismatched output of

Unknown Syntax Error: Command not found; did you mean one of: ... While running lambda instrument -f arn:aws:lambda:us-east-1:123456789012:function:lambda-hello-world --dry-run --layerVersion 10 --logLevel debug --service middletier --env staging --version 0.2 --extra-tags layer:api,team:intake --no-source-code-integration --llmobs

and assumed that when using Options.String('llmobs') expects a value or otherwise causes a failure somewhere. But, i'm not too well-versed on this behavior here.

but, in general, if ml_app is not set, the LLMObs SDKs will throw/raise at runtime.

Oh yeah, I didn't expected it to work in the CI command, just curious about what would happen in LLMObs in general, but

if ml_app is not set, the LLMObs SDKs will throw/raise at runtime.

answers my question.

I'm just curious because it's interesting to see a product that requires a pair of env vars to work properly, normally I'd just see the enabling and that's it – thanks!

sabrenner added 5 commits March 31, 2025 09:49

add llmobs enablement options for lambda

c3786d4

update README

ab87564

add to config

a8ddcc5

add tests

65ed377

update README

578d2e0

sabrenner added serverless Related to [cloud-run, lambda, stepfunctions] documentation Improvements or additions to documentation labels Apr 2, 2025

sabrenner added 3 commits April 2, 2025 16:09

fix README

346e445

force llmobs agentless enabled to false

573c6ad

add uninstrument tests

f9cfa24

sabrenner marked this pull request as ready for review April 7, 2025 14:18

sabrenner requested review from a team as code owners April 7, 2025 14:18

sabrenner requested a review from hannahqjiang April 7, 2025 14:18

duncanista reviewed Apr 7, 2025

View reviewed changes

michaelcretzman approved these changes Apr 7, 2025

View reviewed changes

TalUsvyatsky approved these changes Apr 9, 2025

View reviewed changes

duncanista approved these changes Apr 9, 2025

View reviewed changes

sabrenner merged commit fea1d86 into master Apr 9, 2025
15 checks passed

sabrenner deleted the sabrenner/lambda-instrument-llmobs branch April 9, 2025 18:07

mtalec mentioned this pull request Apr 15, 2025

Release v3.4.0 #1618

Merged

sabrenner mentioned this pull request Apr 16, 2025

[MLOB] revert forced agentless and api key fetching DataDog/datadog-lambda-python#585

Merged

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MLOB-2522] Add `--llmobs` flag for instrumenting Lambdas with LLM Observability #1603

[MLOB-2522] Add `--llmobs` flag for instrumenting Lambdas with LLM Observability #1603

Uh oh!

sabrenner commented Apr 2, 2025 •

edited

Loading

Uh oh!

datadog-datadog-prod-us1 bot commented Apr 3, 2025

Uh oh!

duncanista Apr 7, 2025

Uh oh!

sabrenner Apr 7, 2025

Uh oh!

duncanista Apr 7, 2025

Uh oh!

Uh oh!

Uh oh!

[MLOB-2522] Add --llmobs flag for instrumenting Lambdas with LLM Observability #1603

[MLOB-2522] Add --llmobs flag for instrumenting Lambdas with LLM Observability #1603

Uh oh!

Conversation

sabrenner commented Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What and why?

How?

Review checklist

Uh oh!

datadog-datadog-prod-us1 bot commented Apr 3, 2025

Datadog Report

Uh oh!

duncanista Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

sabrenner Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

duncanista Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

[MLOB-2522] Add `--llmobs` flag for instrumenting Lambdas with LLM Observability #1603

[MLOB-2522] Add `--llmobs` flag for instrumenting Lambdas with LLM Observability #1603

sabrenner commented Apr 2, 2025 •

edited

Loading