Overview
Responsible for monitoring, maintaining, and troubleshooting network infrastructure to ensure optimal performance and minimal downtime. This role requires strong technical expertise, excellent problem-solving skills, and the ability to work in a fast-paced, 24/7 operational environment.
Monitoring & Incident Management
- Monitor network infrastructure, systems, and applications using monitoring tools (e.g., SolarWinds, PRTG, Nagios, Zabbix)
- Identify, diagnose, and resolve network issues, escalating complex problems to senior engineers when necessary
- Respond to alerts and incidents in accordance with established SLAs and priority levels
- Perform root cause analysis on recurring incidents and implement preventive measures
- Document all incidents, actions taken, and resolutions in the ticketing system
Network Operations
- Maintain network availability and performance across enterprise in...