Server Hardware Monitoring with IPMI

Server hardware monitoring is crucial for maintaining system reliability and preventing costly downtime. With Server Scout's IPMI integration, you can monitor critical hardware metrics like temperatures, fan speeds, and power consumption directly from your server's Baseboard Management Controller (BMC).

What is IPMI?

Intelligent Platform Management Interface (IPMI) is an industry standard that allows you to monitor and manage server hardware independently of the operating system. Different manufacturers use various implementations:

  • Dell: DRAC (Dell Remote Access Controller)
  • HP/HPE: iLO (Integrated Lights-Out)
  • IBM: IMM (Integrated Management Module)
  • Supermicro: IPMI

Server Scout uses the standardised IPMI protocol to communicate with all these systems through the ipmitool utility.

Adding an IPMI Device

To begin monitoring server hardware, you'll need to add a device with the drac type in Server Scout.

  1. Navigate to your Server Scout dashboard
  2. Click "Add Device"
  3. Select device type: drac
  4. Configure the following settings:

- Device Name: A descriptive name for your server - BMC IP Address: The IP address of your server's management interface - IPMI Username: BMC username (often root or admin) - IPMI Password: Corresponding password - Monitoring Interval: How frequently to collect metrics (recommended: 5-10 minutes)

# Test IPMI connectivity manually
ipmitool -I lanplus -H <BMC_IP> -U <username> -P <password> sensor list

Hardware Metrics Collected

Server Scout's devices plugin uses ipmitool to gather comprehensive hardware telemetry:

Temperature Monitoring

  • CPU temperatures: Individual processor core temperatures
  • Inlet temperature: Ambient air temperature entering the chassis
  • Exhaust temperature: Hot air temperature leaving the system
  • Memory temperatures: DIMM temperature sensors where available

Fan Speed Monitoring

  • System fans: Chassis cooling fans (measured in RPM)
  • CPU fans: Processor cooling fans
  • Power supply fans: PSU cooling fans

Power Monitoring

  • Total power consumption: Current power draw in watts
  • Power supply status: Health status of each PSU
  • Voltage rails: 12V, 5V, and 3.3V rail monitoring

System Health

  • Chassis intrusion detection: Physical tampering alerts
  • System Event Log (SEL) entries: Hardware error events and warnings

Viewing Hardware Health

Once configured, hardware metrics appear on the device detail page:

  1. Click on your IPMI-enabled device from the dashboard
  2. Navigate to the Hardware tab
  3. Review current readings and historical graphs

The interface displays:

  • Real-time sensor readings with colour-coded status indicators
  • Historical trend graphs for temperature and power consumption
  • Recent SEL entries with timestamps and severity levels
  • Fan speed charts showing cooling system performance

Setting Up Hardware Alerts

Server Scout can automatically alert you when hardware issues arise:

Temperature Alerts

Configure thresholds for critical components:

  • CPU temperature: Alert above 75°C
  • Inlet temperature: Warning above 30°C
  • Exhaust temperature: Alert above 65°C

Fan Speed Alerts

Monitor cooling system health:

  • Fan failure: Alert when RPM drops to zero
  • Reduced airflow: Warning when fans operate below 50% speed

Power Supply Alerts

Ensure power redundancy:

  • PSU failure: Immediate alert on power supply faults
  • Power consumption: Warning when approaching capacity limits

SEL Monitoring

Critical for identifying hardware problems:

  • New error entries: Alert on correctable and uncorrectable errors
  • Predictive failure: Early warning for components showing signs of failure

To configure alerts:

  1. Go to Settings > Alerts
  2. Select your IPMI device
  3. Enable desired alert types
  4. Set appropriate thresholds
  5. Configure notification methods (email, webhook, etc.)

Best Practices

  • Regular SEL review: Check system event logs weekly for patterns
  • Baseline establishment: Monitor normal operating ranges for your environment
  • Redundancy verification: Ensure alerts trigger when redundant components fail
  • Environmental correlation: Compare inlet temperatures with room conditions

IPMI monitoring provides invaluable insight into your server's physical health, helping prevent hardware failures before they impact service availability. With Server Scout's comprehensive hardware monitoring, you can maintain optimal system performance and extend equipment lifespan.

Frequently Asked Questions

How do I set up IPMI monitoring in ServerScout

Add a device with the 'drac' type in ServerScout. Configure the device name, BMC IP address, IPMI username (often 'root' or 'admin'), IPMI password, and monitoring interval. ServerScout uses the standardized IPMI protocol to communicate with Dell DRAC, HP iLO, IBM IMM, and Supermicro IPMI systems.

What hardware metrics does IPMI monitoring collect

ServerScout collects CPU and inlet temperatures, fan speeds for system and CPU fans, total power consumption, power supply status, voltage rail monitoring (12V, 5V, 3.3V), chassis intrusion detection, and System Event Log entries for hardware errors and warnings.

How does IPMI work for server monitoring

IPMI (Intelligent Platform Management Interface) is an industry standard that monitors and manages server hardware independently of the operating system. ServerScout uses the ipmitool utility to communicate with the server's Baseboard Management Controller (BMC) and gather hardware telemetry data.

IPMI monitoring not working troubleshooting

Test IPMI connectivity manually using: 'ipmitool -I lanplus -H <BMC_IP> -U <username> -P <password> sensor list'. Verify the BMC IP address is correct, IPMI username and password are accurate, and the management interface is accessible from your ServerScout installation.

What IPMI hardware alerts can I set up

Configure temperature alerts (CPU above 75°C, inlet above 30°C), fan speed alerts (fan failure when RPM drops to zero), power supply failure alerts, and System Event Log monitoring for hardware errors. Set thresholds in Settings > Alerts and configure notification methods.

What IPMI systems does ServerScout support

ServerScout supports Dell DRAC, HP/HPE iLO, IBM IMM, and Supermicro IPMI systems. All these manufacturer implementations use the standardized IPMI protocol, allowing ServerScout to communicate with them through the same 'drac' device type using ipmitool.

How often should I monitor server hardware with IPMI

ServerScout recommends monitoring intervals of 5-10 minutes for hardware metrics collection. This frequency provides timely detection of hardware issues without overwhelming the BMC. You should also review System Event Logs weekly for patterns and establish baselines for normal operating ranges.

Was this article helpful?