1503365

Careers » Information Technology

Night Operations Specialist (Mid to Senior)

Apply Now

Night Operations Specialist (Mid to Senior)

Bungie Studios is seeking an experienced Network Operations Specialist to assist the BNOC (Bungie Network Operations Center) in maintaining our datacenter and mission critical operations after-hours and on weekends. The candidate must be a dedicated problem solver who can work independently, follow directions, be able to multitask and prioritize things in a fast-paced and demanding environment.

Responsibilities

  • Perform alert-based investigation and troubleshooting
  • Engage and coordinate Engineering and Networking teams to provide critical information to enable break/fix solutions for in-game and service impacting issues
  • Gather game and service metrics to add to investigations and assist in root cause analysis
  • Create and maintain clear documentation and troubleshooting runbooks for common alert types
  • Monitor shared inbox and prioritize escalations from multiple teams
  • Implement and document production system upgrades and patches
  • Manage the tracking and deployment of OS updates in a datacenter environment
  • Provide service reliability and availability by minimizing downtime
  • Perform walkthroughs and troubleshooting of hardware in server labs and IDF closets
  • Provide real-time support for production server farms
  • Verify function and availability of a live game environment
  • Manage escalations between internal teams and external partners
  • Contribute to ongoing development efforts through playtesting, special projects and/or providing technical solutions to problems
  • Manage projects and coordinate with Team Leads and other departments in the studio
  • Design and implement scripts and applications to improve productivity, workflow, or tools
  • Perform BIOS and firmware upgrades on servers and networking equipment
  • Maintain and deploy images of operating systems via MDT and WDS
  • Provide technical support to peers during problem determination and resolution
  • Manage software and service patch validation processes
  • Work with peers to improve the infrastructure for service management, deployment, and patching
  • Create, modify, and maintain multiple subnets in a production environment
  • Track incidents through their life cycle from investigation to root cause analysis

Required Skills

  • A minimum of 5 years IT/Internet hands-on work experience
  • A minimum of 3 years hands-on/NOC or field DC management experience
  • Experience working in a NOC on nights and weekends on a long-term basis
  • Experience with Data Center industry standards including system automation & monitoring
  • Hands-on and field experience with medium to large-scale hardware deployment, installation and troubleshooting, especially on Dell, HP and Cisco blade systems
  • Thorough understanding of a Windows Server and Linux-based operating systems, including system installation and configuration, file system concepts, resource monitoring, user administration
  • Thorough understanding of Windows & Linux network services; specifically, the ability to install, configure, and troubleshoot TCP / IP-based services such DHCP and LDAP
  • Able to write scripts in an administrative language (PowerShell, Shell)
  • Experience with OOB, including ilo/idrac/CMC
  • Knowledge of TCP / IP networking
  • Good interpersonal and communication skills
  • Demonstrated runbook documentation skills
  • Fluency in English
  • Degree in Computer Science or equivalent related NOC field experience

Nice To Have Skills

  • Strong desire to learn new technologies
  • Proficiency in one of the following languages: Python, C#, Go
  • Experience working with Cisco UCS
  • Experience working with storage technologies (NAS & SAN)
  • SQL and NoSQL Database experience
  • Experience working with containerization technologies (Docker, Kubernetes etc.)
  • Experience with industry standard configuration management and deployment systems. (Chef, Puppet, Ansible, Octopus Deploy etc.)
  • Experience working with Elasticsearch, Kibana, Grafana, Graphite or other TSDBs.
  • Experience working with and implementing monitoring systems and solutions.
  • Experience working with Redis
  • Experience working with cloud infrastructure (Amazon, Azure)
  • CCNA or better

Apply for this job

No Yes
No Yes
No Yes