Setting up Azure Event Hubs in Data Manager
The Azure Event Hubs connector for Splunk Data Manager simplifies data onboarding by enabling seamless ingestion of event streams into Splunk Cloud Platform.
Azure Event Hubs is designed to handle large-scale data streams, processing millions of events per second. By integrating with Splunk Data Manager, organizations can efficiently capture continuous data flows, like logs or sensor information, and enhance their analytics capabilities.
The Data Manager Azure Event Hubs connector introduces several powerful features to streamline data ingestion. One major improvement is automatic partition detection, which eliminates the need for manual configuration. When you create an input, Data Manager automatically identifies the available partitions and assigns collectors accordingly. This ensures efficient data collection and reduces setup time.
Key features of the connector include:
- Partition load balancing: The connector uses the Azure Event Hubs Go SDK to automatically distribute data partitions across multiple collectors. This load balancing mechanism improves system resiliency; if a collector fails, others automatically take over its partitions. This ensures uninterrupted data flow and a more stable system overall.
- Checkpointing: Checkpointing tracks the exact position of each consumer within a partition. If a disconnection occurs, the connector resumes data collection from the last checkpoint, preventing gaps or duplicate data ingestion.
- Authentication: The connector supports a secure authentication method that enhances security while simplifying credential rotation. This helps organizations maintain strong access controls without adding complexity to their workflow.
- Event building and error handling: The connector maps event attributes - such as the body and properties - directly to log records. This ensures compatibility and maintains correct formatting, reducing the risk of data inconsistencies.
- Exporter: The connector uses the Splunk HEC exporter with a
/raw
endpoint to properly format and ingest complex event data into Splunk Cloud Platform. This ensures that all collected data is structured for efficient analysis.
This article provides guidance for configuring the Azure Event Hubs Data Input in Data Manager, as well as migrating from the Splunk Add-on for Microsoft Cloud Services to Data Manager.
Configuring the Azure Event Hubs Data Input in Data Manager
- Open Data Manager.
- Select Microsoft Azure > Azure Event Hub > Create a new data input, and then click the Next button.
- If you don't yet have Azure Event Hubs enabled, click First time request guide and fill out the form that follows to request provision. This is usually provided within 24 hours. Otherwise, select Azure Event Hub and click Next.
- Ensure Azure prerequisites are met, and then click Next.
- Enter Azure Event Hubs data information. Note that Data Manager automatically validates the number of partitions. You will be able to change this in edit mode after data onboarding is complete. Click Review Data Input when you're ready.
- Review the data inputs, then click Finish Setup and Monitor Configuration.
- Monitor your inputs through Data Manager.
Migrating from the Splunk Add-on for Microsoft Cloud Services to Data Manager
If you are migrating from the Splunk Add-on for Microsoft Cloud Services, follow these steps. Basic instructions for this process, without screenshots or additional guidance, can be found on Splunk Docs.
Prerequisites
- You must be using at least version 5.4 of the Splunk Add-on for Microsoft Cloud Services.
- Migration is supported only on Splunk Cloud Platform instances.
- All prerequisites can be found in Data Manager under New Input → Microsoft Azure → Azure Event Hub → Migrate from an existing TA.
Step-by-step migration process
- Prepare for migration:
- Verify that your Event Hub inputs are set up and actively pulling data into the Splunk platform via the Splunk Add-on for Microsoft Cloud Services.
- Ensure that all inputs connected to the same Event Hub reside on the same Splunk instance.
- Check input health:
- Navigate to the Splunk Add-on for Microsoft Cloud Services.
- Select the Configuration tab.
- On the Export tab, set Input Status to an inactive state to prepare for export. This pauses ingestion at the current checkpoint. Data ingestion will resume from this checkpoint after migration.
- Ensure that the Ready for export option is checked.
- Click Export.
- Export configuration snapshot:
- Ensure the Health Status is marked as Ready.
- Click Export to generate a JSON snapshot.
- The exported JSON includes:
- Server information
- Timestamp
- Modular input configurations
- Checkpoints (to prevent duplicate data ingestion)
- Import JSON into Data Manager:
- Open Data Manager in Splunk Cloud Platform.
- Click the New Data Input button.
- Select Microsoft Azure → Azure Event Hub.
- Complete any remaining prerequisites.
- Upload the exported JSON file.
- Click Next.
- Enter client secret:
- The client secret is not included in the JSON for security reasons, so you will need to enter it manually.
- Review the list of Event Hubs and associated inputs.
- Deselect any inputs you do not wish to migrate.
- Click Complete Migration of X Data Input to complete the migration process.
- Verify post-migration data ingestion:
- Check that data is correctly ingested in Splunk Cloud Platform.
- Ensure that checkpoints were maintained to prevent duplicate data ingestion.
- Cleanup:
- Wait 24 hours to confirm that all inputs are working correctly.
- Delete modular inputs from the Microsoft Cloud Services Add-on after verification.
Troubleshooting tips
- Desynchronization issues: Ensure all modular inputs for the same Event Hub are on a single Splunk Cloud Platform instance.
- Long processing times: Large instances with numerous modular inputs might experience delays during export/import.
- Error messages: Errors related to index configurations or source types will be flagged during the migration process.
Next steps
These resources might help you understand and implement this guidance:
- Microsoft: Quickstart: Create an event hub using Azure portal
- Splunkbase: Splunk Add-on for Microsoft Cloud Services