Server hardware monitoring is crucial for maintaining system reliability and preventing costly downtime. With Server Scout's IPMI integration, you can monitor critical hardware metrics like temperatures, fan speeds, and power consumption directly from your server's Baseboard Management Controller (BMC).
What is IPMI?
Intelligent Platform Management Interface (IPMI) is an industry standard that allows you to monitor and manage server hardware independently of the operating system. Different manufacturers use various implementations:
- Dell: DRAC (Dell Remote Access Controller)
- HP/HPE: iLO (Integrated Lights-Out)
- IBM: IMM (Integrated Management Module)
- Supermicro: IPMI
Server Scout uses the standardised IPMI protocol to communicate with all these systems through the ipmitool utility.
Adding an IPMI Device
To begin monitoring server hardware, you'll need to add a device with the drac type in Server Scout.
- Navigate to your Server Scout dashboard
- Click "Add Device"
- Select device type: drac
- Configure the following settings:
- Device Name: A descriptive name for your server - BMC IP Address: The IP address of your server's management interface - IPMI Username: BMC username (often root or admin) - IPMI Password: Corresponding password - Monitoring Interval: How frequently to collect metrics (recommended: 5-10 minutes)
# Test IPMI connectivity manually
ipmitool -I lanplus -H <BMC_IP> -U <username> -P <password> sensor list
Hardware Metrics Collected
Server Scout's devices plugin uses ipmitool to gather comprehensive hardware telemetry:
Temperature Monitoring
- CPU temperatures: Individual processor core temperatures
- Inlet temperature: Ambient air temperature entering the chassis
- Exhaust temperature: Hot air temperature leaving the system
- Memory temperatures: DIMM temperature sensors where available
Fan Speed Monitoring
- System fans: Chassis cooling fans (measured in RPM)
- CPU fans: Processor cooling fans
- Power supply fans: PSU cooling fans
Power Monitoring
- Total power consumption: Current power draw in watts
- Power supply status: Health status of each PSU
- Voltage rails: 12V, 5V, and 3.3V rail monitoring
System Health
- Chassis intrusion detection: Physical tampering alerts
- System Event Log (SEL) entries: Hardware error events and warnings
Viewing Hardware Health
Once configured, hardware metrics appear on the device detail page:
- Click on your IPMI-enabled device from the dashboard
- Navigate to the Hardware tab
- Review current readings and historical graphs
The interface displays:
- Real-time sensor readings with colour-coded status indicators
- Historical trend graphs for temperature and power consumption
- Recent SEL entries with timestamps and severity levels
- Fan speed charts showing cooling system performance
Setting Up Hardware Alerts
Server Scout can automatically alert you when hardware issues arise:
Temperature Alerts
Configure thresholds for critical components:
- CPU temperature: Alert above 75°C
- Inlet temperature: Warning above 30°C
- Exhaust temperature: Alert above 65°C
Fan Speed Alerts
Monitor cooling system health:
- Fan failure: Alert when RPM drops to zero
- Reduced airflow: Warning when fans operate below 50% speed
Power Supply Alerts
Ensure power redundancy:
- PSU failure: Immediate alert on power supply faults
- Power consumption: Warning when approaching capacity limits
SEL Monitoring
Critical for identifying hardware problems:
- New error entries: Alert on correctable and uncorrectable errors
- Predictive failure: Early warning for components showing signs of failure
To configure alerts:
- Go to Settings > Alerts
- Select your IPMI device
- Enable desired alert types
- Set appropriate thresholds
- Configure notification methods (email, webhook, etc.)
Best Practices
- Regular SEL review: Check system event logs weekly for patterns
- Baseline establishment: Monitor normal operating ranges for your environment
- Redundancy verification: Ensure alerts trigger when redundant components fail
- Environmental correlation: Compare inlet temperatures with room conditions
IPMI monitoring provides invaluable insight into your server's physical health, helping prevent hardware failures before they impact service availability. With Server Scout's comprehensive hardware monitoring, you can maintain optimal system performance and extend equipment lifespan.
Frequently Asked Questions
How do I set up IPMI monitoring in ServerScout
What hardware metrics does IPMI monitoring collect
How does IPMI work for server monitoring
IPMI monitoring not working troubleshooting
What IPMI hardware alerts can I set up
What IPMI systems does ServerScout support
How often should I monitor server hardware with IPMI
Was this article helpful?