Data/ML Engineer
It’s your job to ensure the right data powers the right applications at the right time and in the right place.
With an increased number and variety of workloads, how can you address all aspects of data logistics and processing that can make or break the success of any data-intensive project, including analytics and AI/machine learning? And do it easily and reliably?
On this page, we provide content to help you meet these challenges. You will find a rotating selection of foundational material, ideas to help you get inspired, as well as practical tips on key issues to improve efficiency and performance. You’ll also learn what Hewlett Packard Enterprise (HPE) offers.
The roles of the Data/ML Engineer and Data Scientist can overlap. You may also find content of interest to you on the Data Scientist page. Content on this page changes as new material becomes available or new topics arise, so check back regularly.
Get Inspired
A sampler of new ideas related to data/ML engineering:
Learn how industry innovation may affect your job.
Building a Foundation
Key to data science projects is a unifying data infrastructure to handle logistics and the containerization of applications
Simplify operations and workflows with the right data fabric and orchestrate containerized applications with open source Kubernetes.
Unit testing isn’t just for code: you need to unit test your data. Watch Deequ: Unit Tests for Data
Data locality helps support GPUs and other accelerators from a data point of view. Read How fine-grained data placement helps optimize application performance
Better connections between data producers and data consumers make data science more successful. Read Getting value from your data shouldn’t be this hard
Study the technical paper HPE Ezmeral Data Fabric: Modern infrastructure for data storage and management
Learn how management of large scale Kubernetes clusters is made easier with HPE Ezmeral Runtime Enterprise
Addressing Key Concerns
What can I do to lower the entry barriers to developing new AI/ML/data science projects?
AI/ML projects can and should be run on the same system as analytics projects: Read “Chapter 3: AI and Analytics Together” in the free eBook AI and Analytics at Scale: Lessons from Real-World Production Systems
Who should be included on the team to ensure the success of the project?
How do I handle data movement?
Read A better approach to major data motion: built-in data mirroring
Watch the webinar Data Motion at Scale: the Untold Story
What makes it easier to deal with edge computing in large-scale systems?
How do I ensure data trust and security?
New approaches are improving the connection between data producers and data consumers. See how in the video Dataspaces: connecting to data you can trust
Learn about the SPIFFE and SPIRE projects that are hosted by the CNCF Foundation
How are others doing this?
Check out these real-world case studies
Skill Up
Munch & Learn technology talkMonthly meetups where you can hear from experts on the newest technologies. Catch up on any you may have missed and register for upcoming talks.
Workshops-on-Demand
Free, in-depth, hands-on workshops that allow you to explore details of a technology by interacting with it. Designed to fit your schedule, these workshops are available 24/7 – from anywhere at any time.