feat: lambda support for DSM #622

michael-zhao459 · 2025-06-20T14:18:25Z

What does this PR do?

This PR adds lambda support for Data Streams Monitoring (DSM) and reworks the original implementation.

DSM context is passed through the trace propagation headers, code is refactored to use existing extraction logic (deleted dsm.py, reinventing the wheel here).
If DSM is enabled, add custom DSM logic to the extracted context afterwards

Motivation

Remove redundant code. DSM customers wanted to have Lambda support, currently context is not propagated correctly with lambdas.

Testing Guidelines

Wrote unit tests on the functions I added and ensured past tests did not break. Tested on AWS Sandbox accounts with all forms of the pipeline

Additional Notes

IMPORTANT NOTE: This PR cannot get merged until DataDog/dd-trace-py#13646 gets released in the tracer. Once a version of the tracer is released with this change, will update the pyproject.toml file

Types of Changes

Bug fix
New feature
Breaking change
Misc (docs, refactoring, dependency upgrade, etc.)

Check all that apply

This PR's description is comprehensive
This PR contains breaking changes that are documented in the description
This PR introduces new APIs or parameters that are documented and unlikely to change in the foreseeable future
This PR impacts documentation, and it has been updated (or a ticket has been logged)
This PR's changes are covered by the automated tests
This PR collects user input/sensitive content into Datadog
This PR passes the integration tests (ask a Datadog member to run the tests)

datadog_lambda/tracing.py

tests/test_dsm.py

datadog_lambda/tracing.py

datadog_lambda/wrapper.py

datadog_lambda/tracing.py

michael-zhao459 · 2025-06-25T12:39:25Z

datadog_lambda/tracing.py

+                if config.data_streams_enabled:
+                    from ddtrace.data_streams import PROPAGATION_KEY_BASE_64
+
+                    data_streams_ctx = {


I know the else is redundant but datadog gets mad if i just do the if too many indents

piochelepiotr · 2025-06-25T12:42:11Z

datadog_lambda/tracing.py

    except Exception as e:
        logger.debug("The trace extractor returned with error %s", e)
-        return extract_context_from_lambda_context(lambda_context)
+        return extract_context_from_lambda_context(lambda_context), None


we should not return None here

joeyzhao2018 · 2025-06-26T11:47:45Z

datadog_lambda/tracing.py

@@ -265,15 +265,27 @@ def extract_context_from_sqs_or_sns_event_or_context(event, lambda_context):
            if dd_json_data:
                dd_data = json.loads(dd_json_data)

+                data_streams_ctx = {}
+                if config.data_streams_enabled:
+                    from ddtrace.data_streams import PROPAGATION_KEY_BASE_64


My main concerns are

Creating dictionary objects and bound methods for every invocation is inefficient.

It is very hard to follow the logic and hard to maintain and may introduce unexpected behaviors that are hard to debug in the future.

May I suggest the following alternative implementation? Let me know what do you think.

def _create_dsm_carrier_func(dd_data): """Create a carrier function for DSM context extraction.""" def carrier_get(key): return dd_data.get(key) if dd_data else None return carrier_get # then in In the extraction functions: if config.data_streams_enabled: dsm_carrier = _create_dsm_carrier_func(dd_data) # Pass the original dd_data else: dsm_carrier = None

I agree with the justifications you made for this change will change the code now!

datadog_lambda/tracing.py

joeyzhao2018

LGTM

joeyzhao2018

LGTM

datadog_lambda/wrapper.py

…n None instead of {}

piochelepiotr · 2025-07-02T16:55:03Z

datadog_lambda/tracing.py

@@ -373,10 +421,15 @@ def extract_context_from_kinesis_event(event, lambda_context):
            data_obj = json.loads(data_str)
            dd_ctx = data_obj.get("_datadog")


when would that be set for Kinesis?

https://github.com/DataDog/dd-trace-py/blob/926d8383af8e71c6a83494adf85918a4ad7cf920/ddtrace/internal/datastreams/botocore.py#L230 Where we inject DSM context for Kinesis produce call

datadog_lambda/tracing.py

michael-zhao459 · 2025-07-02T17:03:29Z

datadog_lambda/tracing.py

-                return propagator.extract(dd_data)
+                context = propagator.extract(dd_data)
+                # Do not want to set checkpoint with "" arn
+                if arn:


A "" arn would end up with a queue with no name, causing many collisions and overall bad behavior. Only set a checkpoint if arn has a non empty string

datadog_lambda/tracing.py

datadog-datadog-prod-us1 bot reviewed Jun 20, 2025

View reviewed changes

datadog_lambda/tracing.py Outdated Show resolved Hide resolved

purple4reina reviewed Jun 20, 2025

View reviewed changes