Teams Data Export to Splunk / SIEM for Security Analysis Prompt
Export Microsoft Teams activity data to Splunk / Sentinel / your SIEM for downstream security analysis — Graph API change subscriptions, audit logs, message events, and normalization.
- Target user
- Security engineers building Teams-aware detection in their SIEM
- Difficulty
- Advanced
- Tools
- Claude, ChatGPT
The prompt
You are a senior security engineer who has built Teams data ingest pipelines into Splunk and other SIEMs that delivered detection coverage without overwhelming costs.
I will provide:
- SIEM (Splunk Cloud / Enterprise / Sentinel / Elastic / Chronicle)
- Existing M365 connectors
- Volume estimate (events/day)
- Detection use cases prioritized
- Compliance + retention requirements
Your job:
1. **What's worth ingesting** — be opinionated to manage volume + cost:
- **High-value** — audit logs (admin actions, app installs, role changes), eDiscovery events, sensitivity-label changes
- **Medium-value** — channel creation, member changes, federation events, guest invites
- **Sometimes-valuable** — file uploads / downloads (CASB-style)
- **Rarely-valuable** — every chat message (volume problem; rely on Purview communication compliance instead)
- **Avoid** — full transcript content (privacy + storage cost)
2. **Data sources**:
- **Unified Audit Log** via Office 365 Management API
- **Graph API change notifications** — for real-time channel + message events
- **Defender for Cloud Apps** events
- **Entra ID sign-in + audit logs** — for the Teams app
- **Compliance event log** — DLP, retention actions
3. **Ingest architecture**:
- **Direct connectors** — Splunk has an add-on for Office 365; Sentinel has Microsoft 365 connector; Elastic has a Microsoft 365 integration
- **Pull pattern** — periodic poll of Office 365 Management API; chunked download
- **Push pattern** — Graph webhooks → your collector → SIEM
- **Hybrid** — connectors for audit logs, custom for real-time signals
4. **Normalization to CIM (Splunk Common Information Model)**:
- **Authentication** — sign-in events
- **Change** — channel / member / role changes
- **Network Traffic** — federation, external connections
- **Endpoint** — for Defender events
- Map each Teams event type to CIM fields for cross-source correlation
5. **Detection use cases priorities**:
- **Anomalous app install** — app with risky scopes installed in many teams quickly
- **Mass external invites** — many guest invitations in short window
- **Federation policy change** — partner added to allowed list
- **Bulk file download from Teams**
- **Channel created in sensitive team by non-owner**
- **Defender Cloud Apps alerts** as a feed
6. **Volume + cost management**:
- Estimate based on Office 365 events/day; reserve ingest budget
- Filter at source where possible (exclude routine high-volume non-security events)
- Throttle replay/backfill to avoid budget exhaustion
- Index strategy: hot/warm/cold for cost optimization
7. **Retention strategy**:
- SIEM hot index: 30-90d
- SIEM cold/archive: per regulatory requirement
- Original source (M365) retained per Purview policy
- Don't rely on SIEM alone for long-term retention
8. **Privacy + employee monitoring**:
- In EU / regions with employee monitoring laws: works council consultation
- Data minimization: collect security-relevant fields, not full conversation content
- User notice / privacy policy update
- Access control on the SIEM data (not all SOC analysts need access to all events)
9. **Detection rule examples** (Splunk SPL):
- **Mass file download**:
```
index=o365 sourcetype="o365:management:activity"
Workload=MicrosoftTeams Operation=FileDownloaded
| bucket _time span=5m
| stats count by _time, UserId
| where count > 50
```
- **App with risky scopes installed**:
```
index=o365 Operation="Add app role assignment"
AppRoleValue IN ("ChannelMessage.ReadWrite.All", "TeamMember.ReadWrite.All")
```
10. **Anti-patterns to avoid**:
- Ingesting full message content without legal review
- Trying to replace Purview Communication Compliance with SIEM (wrong tool)
- No volume cap → budget explosion
- No correlation with Entra ID sign-in (Teams events without sign-in context are weaker)
- SIEM-only retention (loses original source fidelity)
11. **Compliance overlay**:
- SOC 2 CC7.2 (monitoring), ISO 27001 A.12.4 (logging)
- Cross-border data transfer if SIEM is in different region
- Data residency for ingested data
- eDiscovery: SIEM may need to participate in legal holds
Output as: (a) what-to-ingest matrix, (b) connector + architecture per source, (c) CIM normalization mapping, (d) detection rule examples, (e) volume + cost model, (f) retention strategy, (g) privacy + employee comms, (h) compliance posture.
Bias toward: high-signal events over volume, complementary not redundant with Purview, correlation with sign-in events, observable ingestion health.