Operations Engineer

Published
2016-08-18 16:02
Written by
Name of Company
Internet Archive
Type of Work
General IT
Telecommute ok?
No
Time commitment
Full Time
How to apply
Please send your resume and cover letter to Jobs+Managerofoperationsandnetworks@archive.org with the subject line 'CI-410: Operations Engineer'

Location: Inner Richmond, San Francisco, CA and City of Richmond, CA ON-SITE PRESENCE IN SF/RICHMOND IS REQUIRED! Remote employment not available for this position.

Job Classification: Full-time, exempt

Job Summary: The Internet Archive has over 25PB of unique digital information, all running across an integrated cluster of over 700 VMs on over 550 'bare-metal' hosts in 2 data centers. We are looking for a 'hands-on' operations manager and network engineer with proven experience effectively managing a high-performance team of system administrators and technical operations staff. The ideal candidate will be looking to take on a 'player-coach' role and have demonstrated experience improving and maintaining the reliability, performance, and security of both internal and publicly facing web infrastructure, online services, networks, and database systems. They must also be skilled in management communications and able to work collaboratively with our team of talented engineers and program staff.

Essential Job Functions:

    • Manage, contribute to, and mentor the technical team responsible for monitoring, maintaining, and restoring the health of all Internet Archive networks and online services. This includes all publicly-facing services, the storage and compute cluster, as well as key internal services related to crawling, indexing, and access to archived web content
    • Maintain and expand monitoring and reporting systems to communicate current and historical activity for multiple publicly facing Services and to ensure service continuity and performance.
    • Analyze, implement, and manage effective improvements in the maintenance and operations processes and infrastructure.
    • Assign, support, recruit, hire, schedule, and fire staff as needed to sustain operational objectives and efficiency.
    • Recommend the purchase of equipment needed to sustain responsive services and cost-effective operations.

Minimum Qualifications:

  • Experience managing large server cluster infrastructure
  • Experience as lead manager and mentor of a technical operations team
  • Passion and fierce advocate for the end user experience of web-delivered services
  • Experience in highly available 24x7 production environment.
  • Ability to 'fire fight' personally and to document and share critical knowledge with others
  • Passion for automation, data-driven decision making, and information reporting
  • Experience with high-bandwidth networking environments
  • Deep technical understanding of virtual hosts, containers, network architecture, DNS, DHCP
  • Work history that includes production-level programming in high-transaction environments.
  • Fluency in Linux system administration, Unix shell scripting, and familiarity with Python, PHP, etc.
  • Experience deploying and administering database, search, and web-host services
  • Excellent and creative problem solver. You do not need to know everything but you need to know how to find the solution.
  • Experienced in open source practices and passion for staying current with industry trends
  • Willingness to travel to network operation centers and participate, as necessary, in physical equipment install
  • BS Computer Science, or equivalent work experience

 

Preferred Qualifications:

    • Extensive experience with Ansible, Git, Nagios, Postgres, Redis, ELK stack, etc.
    • Experience deploying and maintaining big-data analytics tools, especially Hadoop, Druid, or RethinkDB
    • Excellent oral/written communication and documentation skills
    • MS in Computer Science or equivalent work experience
    • Flexibility and a sense of humor

Reporting Structure: The Manager of Operations Engineering reports to the Director of Engineering and works closely with the Head Librarian and Founder.

To Apply: Please send your resume and cover letter to Jobs+Managerofoperationsandnetworks@archive.org with the subject line 'CI-410: Manager of Operations and Networks.'

Internet Archive reserves the right to revise job descriptions or work hours as required.

Internet Archive is an Equal Opportunity Employer and a 501(c)(3) non profit library founded in 1996.

The Archive will consider for employment-qualified applicants with criminal histories in a manner consistent with the requirements of the Fair Chance Ordinance.