Best practices for monitoring health

To ensure the health of the components in your data center environment, follow these best practices.

Overall health monitoring


[NOTE: ]

NOTE: You can view health and alerts on some monitored servers. To see what servers can be monitored, see monitored server hardware in the HPE OneView Support Matrix.


Monitoring step

Related information: Activity

Related information: Server Hardware

View the originating event(s) that caused a specific alert.

  1. Select an alert.

    GET /rest/alerts/

  2. Get a specific alert using the alert ID.

    GET /rest/alerts/{id}

  3. Get the associated event(s).

    GET /rest/events/{id}

  • Fix the problem. Use the recommended fix (perform a GET operation on the specific alert resource and view the correctiveAction attribute), or research the alert.

Server hardware health monitoring

A server or servers turn to a warning or critical status when something is not correct within the appliance. If a server profile has been applied to a failed server, the server profile will also be in a failed status.

Monitoring step

Related information: Activity

  • Make sure that server profiles are appropriately assigned to the server hardware.

Related information: Create a server profile

Network health monitoring

To determine the current health of a network or networks on the appliance, view alerts for interconnects and logical interconnects to verify the correct connections. To list alerts, you can perform a GET operation on alerts and filter for alerts related to interconnects. To list states, you can perform a GET operation on interconnects and logical interconnects and filter for an OK state.

Monitoring step

View alerts for interconnects.

  1. Select an interconnect alert.

    GET /rest/alerts?filter="physicalResourceType='{interconnect}'"&filter="severity='{WARNING, CRITICAL}'"

  2. Get a specific alert using the alert ID.

    GET /rest/alerts/{id}

Related information: Interconnects

Filter for logical interconnects with unhealthy stacking.

  1. Get unhealthy logical interconnect.

    GET 
    /rest/logical-interconnects?filter="stackingHealth='{Unknown, Disconnected}'"
    
  2. View specific unhealthy interconnect using the interconnect ID.

    GET 
    /rest/logical-interconnects/
    
    {id}
    

Related information: Logical Interconnects

  • Use information provided in the alert to fix the problem. Use the recommended fix if there is one, or research the alert.

Related information: Activity