Skip to content

SSI Troubleshooting Updates #30840

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 4 commits into
base: master
Choose a base branch
from

Conversation

domalessi
Copy link
Contributor

@domalessi domalessi commented Jul 31, 2025

What does this PR do? What is the motivation?

https://datadoghq.atlassian.net/browse/DOCS-11673

Updates the SSI Troubleshooting guide:

  • Adds a section about using SSI troubleshooting features in Datadog UI
  • Restructures content for clearer breakout/organization of sections

Merge instructions

Merge readiness:

  • Ready for merge

For Datadog employees:

Your branch name MUST follow the <name>/<description> convention and include the forward slash (/). Without this format, your pull request will not pass CI, the GitLab pipeline will not run, and you won't get a branch preview. Getting a branch preview makes it easier for us to check any issues with your PR, such as broken links.

If your branch doesn't follow this format, rename it or create a new branch and PR.

[6/5/2025] Merge queue has been disabled on the documentation repo. If you have write access to the repo, the PR has been reviewed by a Documentation team member, and all of the required checks have passed, you can use the Squash and Merge button to merge the PR. If you don't have write access, or you need help, reach out in the #documentation channel in Slack.

Additional notes

@domalessi domalessi requested a review from a team as a code owner July 31, 2025 20:45
@domalessi domalessi added the WORK IN PROGRESS No review needed, it's a wip ;) label Jul 31, 2025
Copy link
Contributor

github-actions bot commented Jul 31, 2025

📝 Documentation Team Review Required

This pull request requires approval from the @DataDog/documentation team before it can be merged.

Please ensure your changes follow our documentation guidelines and wait for a team member to review and approve your changes.

Copy link
Contributor

@sarjyusuf
Copy link
Contributor

Do not merge until we get approval from me and @betterengineering

<li>Environments: Linux hosts, containers, Kubernetes</li>
<li>Datadog Agent v7.68.2+</li>
</ul>
<p>To enable it, contact support or turn on the <code>apm-ssi-troubleshooting</code> feature flag.<p>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is public/customer facing right? I don't think users have the ability to do this.

Copy link
Contributor Author

@domalessi domalessi Aug 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@avivenzio-dd Should I just say "To enable it, contact support." ??


Start with the UI to spot issues across multiple hosts or services before investigating logs or containers directly.

To access instrumentation insights in Datadog:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is instrumentation insights how we are branding this feature?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

|-------|--------|--------|------------------|
| Other Agent already attached | Injection failed | Another tracer is already loaded in the process | Remove existing tracer and restart the process, or disable SSI for this service. |
| Unsupported runtime version | Injection failed | For example, Node.js 16.14.2 detected (minimum: 18.x) | Upgrade to a supported runtime version. |
| Permission denied | Injection failed | Insufficient privileges to inject into process | Ensure the Agent has sufficient permissions. Check container security policies and capabilities. |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

for permissions can we show the required permissions? or link to a page that has them?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment on lines 38 to 39
- `single_step_instrumentation` shows which hosts have SSI enabled or disabled.
- `instrumentation_status` shows which hosts have encountered issues with service instrumentation.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Being that we are explaining the UI, we should use the labels used in the UI rather than the request parameter names.

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Technically users can write the facet queries into the query box, so this is helpful info. I agree though that maybe we should also include where to find the matching facets on the facet column in the UI.


If a service is not instrumented and no failure reason is listed, check whether the process exited quickly or used a language that isn't yet supported.

### Manually verify injection in the application container
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why would a customer need to do this manual verification if they have the troubleshooting data in the UI?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@sarjyusuf Can you weigh in on how to frame this content? Do we want it at all? This is what was pulled in from that Troubleshooting Confluence page. Perhaps this section if redundant or obsolete with the new UI-based capabilities?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
WORK IN PROGRESS No review needed, it's a wip ;)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants