Bug 359307

Summary: Samsung P35:Shutdown caused by critical temperature
Product: [openSUSE] openSUSE 10.3 Reporter: Christian Trippe <ctrippe>
Component: KernelAssignee: Thomas Renninger <trenn>
Status: RESOLVED INVALID QA Contact: E-mail List <qa-bugs>
Severity: Normal    
Priority: P5 - None    
Version: Final   
Target Milestone: ---   
Hardware: i686   
OS: openSUSE 10.3   
Whiteboard:
Found By: --- Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---

Description Christian Trippe 2008-02-06 19:40:46 UTC
I have a Samsung P35 and tried to compile an Kernel because of bug 331011. During the make-process the critical temperature is reached and the system is halt. This is reproducable, I tried twice.


uname -r
2.6.22.16-0.2-default


excerpt from /var/log/messages

Feb  6 19:46:59 notebook smartd[3220]: Device: /dev/sda, SMART Usage Attribute: 194 Temperature_Celsius changed from 100 to 94
Feb  6 19:53:13 notebook kernel: ACPI: Critical trip point
Feb  6 19:53:13 notebook kernel: Critical temperature reached (105 C), shutting down.
Feb  6 19:53:13 notebook kernel: Critical temperature reached (104 C), shutting down.
Feb  6 19:53:14 notebook shutdown[2552]: shutting down for system halt
Feb  6 19:53:15 notebook shutdown[2553]: shutting down for system halt
Feb  6 19:53:18 notebook init: Switching to runlevel: 0
Feb  6 19:53:25 notebook gconfd (christian-4430): Signal 15 erhalten, ordungsgemäßes Herunterfahren
Feb  6 19:53:25 notebook gconfd (christian-4430): Beenden
Feb  6 19:53:30 notebook smartd[3220]: smartd received signal 15: Terminated
Feb  6 19:53:30 notebook smartd[3220]: smartd is exiting (exit status 0)
Feb  6 19:53:30 notebook smpppd[3130]: terminating on signal 15
Feb  6 19:53:31 notebook auditd[2808]: The audit daemon is exiting.
Feb  6 19:53:31 notebook kernel: audit(1202324011.332:6): audit_pid=0 old=2808 by auid=4294967295
Feb  6 19:53:31 notebook kernel: ACPI: PCI interrupt for device 0000:00:1f.5 disabled
Feb  6 19:53:31 notebook kernel: ACPI: PCI interrupt for device 0000:00:1f.6 disabled
Feb  6 19:53:32 notebook sshd[3224]: Received signal 15; terminating.
Feb  6 19:53:33 notebook kernel: Kernel logging (proc) stopped.
Feb  6 19:53:33 notebook kernel: Kernel log daemon terminating.
Feb  6 19:53:33 notebook syslog-ng[2218]: syslog-ng version 1.6.12 going down
Comment 1 Thomas Renninger 2008-02-07 09:56:38 UTC
> smartd[3220]: Device: /dev/sda, SMART Usage Attribute:
> 194 Temperature_Celsius changed from 100 to 94
Does the smartd also read out temperature? Looks like the same temperature the ACPI thermal module reads. They may bite each other, confuse the thermal sensor and falsify the values.
Hmm, but the values still make sense (I'd expect unsane values like 3000 then).

Please, first make sure the fan slots are dust free!
Every second bug about thermal shutdowns is because of dirty fan slots.
Just using a strong vacuumer when the machine is switched off, could be enough. Sometimes you even have to open the machine to clean it.
Comment 2 Christian Trippe 2008-02-08 12:51:18 UTC
Thanks for the hint to clean the fan slots. This has solved the problem. I had also stopped the smartd, but I hope to see no different behaviour with running smartd.

I mark this bug as invalid.

Sorry for the noise.