Cloud Systems Developer - AI Operations

For over a decade now, OpenNebula Systems has been leading the development of the European open source technology that helps organizations around the world to manage their corporate data centers and build their Enterprise Clouds.

If you want to join an established leader in the cloud infrastructure industry and the global open source community, keep reading, because you can now join a team of exceptionally passionate and talented colleagues whose mission is to help the world's leading enterprises to implement their next-generation edge and cloud strategies. We are hiring!

Since 2019, and thanks to the support from the European Commission, OpenNebula Systems has been leading the edge computing innovation in Europe, investing heavily in research and open source development, and playing a key role in strategic EU initiatives such as the IPCEI-CIS and the “European Alliance for Industrial Data, Edge and Cloud”.

We are currently looking for a Cloud Systems Developer to come and join us in Europe as part of our new team developing the AI-enabled operations component of the next generation management platform for the Cloud-Edge Computing Continuum.

Job Description

Cloud Systems Developers - AI Operations - are responsible for coding the AI-enabled engine to optimize operations on cloud and edge infrastructures. The engine will provide smart monitoring, intelligent workload forecasting, workload and infrastructure orchestration capabilities, and log and metric anomaly detection. The new Systems Developers will also help in activities related to enabling new programming frameworks for data processing on the cloud-to-edge continuum.

Systems Developers focus on ensuring that cloud management products align with project and business goals, comply with best practices, and meet performance requirements. You’ll work in an agile environment to design, develop, test, maintain and validate with use cases a next generation management platform for the Cloud-Edge Computing Continuum. You will also participate in the upstream community, on challenging projects developing innovative edge/cloud systems. Applicants should be passionate about the future of the software defined datacenter, distributed systems, and open source.

What you will do

  • Analyzing functional and systems requirements, designing, developing and maintaining cloud systems architectures, and solutions to meet the demands of AI-enabled and data operations on multi-cloud environments
  • Designing component APIs, interfaces and AI/ML models for smart monitoring, observability, orchestration, and anomaly detection
  • Evaluating the performance of AI-Enabled cloud systems to identify bottlenecks and bugs
  • Test-driven development of large solutions integrating and extending open source products, and using Git based workflows to develop new features in the project repositories
  • Writing and maintaining software documentation
  • Working with user use cases to test, debug, and troubleshoot software, assuring quality and functionality
  • Collaborating with other companies in the cloud-edge ecosystem within international projects and open-source communities. Availability to occasional travel and participation in international events and meetings
  • Collaborating with the integration and deployment teams in support and issue troubleshooting and triage, the use cases and solutions team in discovery and demo sessions, and the community team in contributions to the open-source community

What you will bring

  • Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field
  • 3+ years of hands-on experience in AI/ML and cloud systems development and integration using open-source technologies
  • Demonstrated expertise in researching, developing, and implementing AI/ML algorithms for predictive analytics, workload optimization, log analysis, and anomaly detection to enhance system performance trends.
  • Proficiency in designing, developing and maintaining Python code used both for AI/ML, data processing and in general software engineering tasks. Extensive knowledge of Python ML frameworks (Pytorch, TensorFlow, Keras), ML development lifecycle tools (Kubeflow, MLFlow) and experience in building RESTful APIs for serving AI/ML products.
  • Experience deploying and using monitoring and alerting tools like Prometheus. Design monitoring metrics and data structure. Elaborate and analyze usage reports obtained from the monitor data.
  • Background in deploying and operating Kubernetes systems.
  • Experience with Cloud Management technologies and their associated technologies. Understand the implication of managing virtualized infrastructures and the orchestration of the underlying subsystems.

What's in it for me?

Some of our benefits and perks vary depending on location and employment type, but we are proud to provide employees with the following;

  • Base Salary Range 38-50k
  • Competitive compensation package and flexible remuneration: Meals, Transport, Nursery/Childcare
  • Customized workstation (macOS, Windows, Linux)
  • Private health insurance
  • Paid time off: Holidays, Personal Time, Sick Time, Parental leave
  • Afternoon-off every friday and during summer
  • Remote company with bright HQ centrally located in Madrid; offices in Boston (USA), Brussels (Belgium) and Brno (Czech Republic); and access to office space near your location when needed. During the first year, for onboarding purposes, and for participation on certain projects, employees should be able to attend events and face-to-face meetings in our Madrid offices and other European cities. All employees are also required to attend our company-wide face-to-face all-hands meetings twice a year
  • Healthy work-life balance: We encourage the right for Digital Disconnecting and promote harmony between employees personal and professional lives
  • Flexible hiring options: Full Time/Part Time, Employee (Spain/USA) / Contractor (other locations)
  • We are building an awesome, Engineering First Culture and your opinion matters: Thrive in the high-energy environment of a young company where openness, collaboration, risk-taking, and continuous growth are valued
  • Be exposed to a broad technology ecosystem. We encourage learning and researching new technologies and methods as part of your everyday duties

Desired Skills

How to Apply

https://careers.opennebula.io/jobs/3443396-cloud-systems-developer-ai-operations

Contact Info

Posted: May 22, 2024

Apply


Get Updates