Back To Schedule
Wednesday, May 13 • 9:30am - 9:45am
Overview of Hardware Fault Management Subproject

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

System failures due to hardware faults in large scale data center are difficult to diagnose; resulting in inefficient repair action to maintain the quality service of the fleet. Presented here is an overview of the proposed “Hardware Fault Management” subproject within the “System and Hardware Management” working group. The goal of this subproject is to 1) Develop a HW Fault management Solution Guide. 2) Develop Solution Validation Recipes; to assist OCP system and hardware management in large scale data center.


Zhengyu Yang

Hardware System Engineer, Facebook
avatar for Anil Agrawal

Anil Agrawal

Technial Lead (RAS), Intel Corp
Anil has been working as electrical engineer at Intel since 2000. He is passionate about solving customer issues related to system architecture, design, and validation. His focus has been to develop system fault handling solutions. Please refer to Anil\'s linked-in profile for more... Read More →

Wednesday May 13, 2020 9:30am - 9:45am
EW: Systems & HW Management