- Day- to-day tasks consisted of monitoring, measuring, and maintaining the availability and health of Splunk services and platforms. Provided ongoing support for Splunk platforms and AWS Cloud services as required e.g., problem and incident management and taking part in troubleshooting for service recovery.
- Successfully designed a Firewall troubleshooting dashboard, tracking traffic flows through firewalls to identify IP or Port blockages between sites in different regions, which allowed for engineers to act promptly to remediate any issues.
- Engineered an In-House SNOW Plugin Integration for AWS Splunk, Splunk On-Prem, and Splunk Enterprise Security
- Performed integration activities to connect with 3rd party software APIs, enhancing overall system capabilities.
- Crafted a Python script for the successful reassignment of over 20,000 knowledge objects and role mappings via AWX.
- Developed a master script for building all Cortex components in a single build as part of AMI updates.
- Implemented Telegraf observability throughout the Splunk platform via AWX, mitigating potentialresource outages and supporting expansion needs. Developed a Telegraf upgrade script for seamless updates.
- Performed DNS entry changes on Deployer, Cluster Master, Indexers, and other Splunk instances within UAT and Production environments.
- Responsible for configuring AWS resources, including S3 buckets, Load Balancers, Security Groups, and IAM Roles and policies.
- Led the scaling of Splunk Indexer cluster and Search Head Cluster, conducting server resizing to meet operational demands.
- Conducted a thorough review and set up of new Props.conf and Transforms.conf configurations for all data sources within the Splunk platform, enhancing data enrichment and processing efficiency.
- Managed SSL certificates for secure communications, ensuring the confidentiality of data.
- Develop, create, and manage custom Splunk Knowledge objects, including alerts, macros, eventtypes,
field aliases, and dashboards, etc.
- Designed an E-mail flow troubleshooting dashboard, monitoring e-mail flows, and addressing blocks from external senders to internal recipients.
- Acted as a single point of contact for Splunk technical questions, software issues, and for management escalations, granting approvals and denials for infrastructure and platform change requests.
- Performed data integration via HTTP Event Collector (HEC) in order to efficiently send data to Splunk Enterprise and Splunk Cloud.
- Configured a multi-site cluster for disaster recovery planning; Set up DR validation scripts that carried out DR tests on new infrastructure.