The Institute of Data Intensive Engineering and Science (IDIES), in The Henry A. Rowland Department of Physics and Astronomy Department, seeks candidates for the position of IDIES Dev Ops Engineer (Systems Engineer), with background in highly specialized research computing environments. Overview: This position develops and maintains reliable, reusable automation processes for all IDIES data center resources, including deployment, maintenance, and integration with monitoring services, providing IT support for IDIES with particular emphasis on highly specialized research computing environments. This position is responsible for automating the installation, configuration, security, and monitoring of IDIES computing resources in alignment with established local standards. In the case of new technology, this position will be instrumental in the development of new standards according to best practices and stakeholder needs. A particular challenge will be to identify common ground among the various supported projects to maximize efficiency and minimize the impact of one-offs. Solutions should be well-documented and maintainable by members of the team. This position will include ad-hoc on-call responsibilities to monitor team Slack chat and email during off hours and respond to issues based upon level of severity and escalate where necessary.
In support of IDIES, this position interacts routinely with IDIES systems administrators, engineers, developers, and may, also, interface directly with clients on an ad hoc basis to develop solutions to support cutting-edge research. IDIES and the IDIES data center currently supports:
- Over 300 users across 30+ projects.
- 300 systems and 50 virtual machines in 2 datacenters.
- High-speed (10/40/100Gb) network infrastructure.
- 4000 hard drives with over 15 petabytes of storage capacity.
- While common infrastructure elements are shared across projects, there is also considerable heterogeneity due to one-off project-specific requirements.
Specific Duties and Responsibilities:
- Design/Develop/ Automate Server Deployment and Maintenance Tasks (Linux and Windows)
- Design/Develop/ Automate Data Center Network and Power Infrastructure (Switches and PDUs)
- Design/Develop/ Automate Facilities, Network, Server, and Storage Monitoring Infrastructure
- Design/Develop/ Automate ad hoc Project Based Solutions
Listed skills specific to this position:
- Excellent software development skills
- Advanced scripting skills in bash, Perl, Python, Powershell, and Ruby
- Strong knowledge of configuration management tools such as Ansible and Chef
- Strong knowledge of version control systems such as Subversion and Git
- Strong background in planning and implementing enterprise class, highly robust technology solutions
- Strong understanding of security “best practices”
- Must have in-depth knowledge of the technologies used in the school, including Windows, MacOS, and Red Hat Linux operating systems, application software, hardware, and client-server interactions
- Strong knowledge of real-time monitoring systems, time-series databases, and web front-ends
(e.g. Nagios, Bosun, Telegraf/InfluxDB/Grafana, Prometheus) - Strong understanding of containerization and virtualization (e.g. Docker, Podman, Kubernetes, KVM)
- Strong knowledge of Linux systems administration, including familiarity with automated cluster deployments, configurations, and management
- Strong knowledge of industry-standard database systems with the ability to utilize them in specialized configurations to meet the unique extreme-computing needs of data intensive science applications
- Strong understanding of high-speed Ethernet network deployment and switch management
- Strong understanding of GPU hardware and computing environments (e.g. NVIDIA, CUDA)
- Strong understanding of storage and filesystem management, including ext3, XFS, ZFS, and Ceph
- Strong knowledge of services such as DHCP, Bind, NTP, Apache, LDAP, Kerberos, and Active Directory
- Strong knowledge of workload management/batch queueing systems such as Slurm
- Ability to manage multiple simultaneous projects or tasks and prioritize effectively
- Strong interest in science and technology with the ability to adapt quickly to leading-edge technologies
- Genuine interest in “tinkering”
- Strong critical and analytical skills
- Strong customer service skills
- Ability to complete tasks in a timely fashion
- Ability to work independently under established deadlines and collaboratively as a team member
- Ability to write clear documentation
Minimum Qualifications (Mandatory):
- Bachelor’s degree. Additional experience can be substituted for education.
- Five years related experience. Additional education can be substituted for experience.
Preferred Job Qualifications:
- Knowledge in the assigned IT environments.
JHU Equivalency Formula: 30 undergraduate degree credits or 18 graduate degree credits = 1 year of experience. For jobs where equivalency is permitted, up to two years of non-related college coursework may be applied towards the total minimum education/experience required for the respective job.
Classified Title: IDIES Dev Ops Engineer
Working Title: IDIES Dev Ops Engineer
Role/Level/Range: ATP/04/PE
Starting Salary Range: $69,831 – $95,953 annually (commensurate with experience)
Employee group: Full Time
Schedule: Monday-Friday, 8:30am – 5:00 pm
Exempt Status: Exempt
Location: Homewood Campus
Department name: Physics and Astronomy
Personnel area: School of Arts & Sciences
The successful candidate(s) for this position will be subject to a pre-employment background check.
If you are interested in applying for employment with The Johns Hopkins University and require special assistance or accommodation during any part of the pre-employment process, please contact the HR Business Services Office at jhurecruitment@jhu.edu. For TTY users, call via Maryland Relay or dial 711.
Johns Hopkins has mandated COVID-19 and influenza vaccines, as applicable. Exceptions to the COVID and flu vaccine requirements may be provided to individuals for religious beliefs or medical reasons or because the individual is pregnant or attempting to become pregnant. Requests for an exception must be submitted to the JHU vaccination registry. For additional information, please visit: https://covidinfo.jhu.edu/health-safety/covid-vaccination-information/
The following additional provisions may apply depending on which campus you will work. Your recruiter will advise accordingly.
The pre-employment physical for positions in clinical areas, laboratories, working with research subjects, or involving community contact requires documentation of immune status against Rubella (German measles), Rubeola (Measles), Mumps, Varicella (chickenpox), Hepatitis B and documentation of having received the Tdap (Tetanus, diphtheria, pertussis) vaccination. This may include documentation of having two (2) MMR vaccines; two (2) Varicella vaccines; or antibody status to these diseases from laboratory testing. Blood tests for immunities to these diseases are ordinarily included in the pre-employment physical exam except for those employees who provide results of blood tests or immunization documentation from their own health care providers. Any vaccinations required for these diseases will be given at no cost in our Occupational Health office.
Equal Opportunity Employer
Note: Job Postings are updated daily and remain online until filled.
EEO is the Law
Learn more:
https://www.eeoc.gov/sites/default/files/migrated_files/employers/poster_screen_reader_optimized.pdf