mbox series

[SRU,OEM-5.6,0/3] Prevent thermal shutdown during boot process

Message ID 20201129173506.738561-1-kai.heng.feng@canonical.com
Headers show
Series Prevent thermal shutdown during boot process | expand

Message

Kai-Heng Feng Nov. 29, 2020, 5:35 p.m. UTC
BugLink: https://bugs.launchpad.net/bugs/1906168

[Impact]
Surprising thermal shutdown at boot on Intel based mobile workstations.

[Fix]
Since these thermal devcies are not in ACPI ThermalZone, OS shouldn't
shutdown the system.

These critial temperatures are for usespace to handle, so let kernel
know it shouldn't handle it.

SRU for stable kernels will be sent after the fix is in upstream.

[Test]
Use reboot stress as a reproducer. 5% chance to see a surprising
shutdown at boot.

With the fix applied, the thermal shutdown is no longer reproducible.

[Where problems could occur]
For ACPI based platforms, we still have "acpitz" to protect systems from
overheating. If these acpitz sensors don't work, then the system could
face real overheating issue.

Kai-Heng Feng (3):
  UBUNTU: SAUCE: thermal: core: Add indication for userspace usage
  UBUNTU: SAUCE: thermal: int340x: Indicate userspace usage
  UBUNTU: SAUCE: thermal: intel: intel_pch_thermal: Indicate userspace
    usage

 drivers/thermal/intel/int340x_thermal/int3400_thermal.c     | 1 +
 .../thermal/intel/int340x_thermal/int340x_thermal_zone.c    | 1 +
 drivers/thermal/intel/intel_pch_thermal.c                   | 6 +++++-
 drivers/thermal/thermal_core.c                              | 3 +++
 include/linux/thermal.h                                     | 2 ++
 5 files changed, 12 insertions(+), 1 deletion(-)

Comments

AceLan Kao Dec. 2, 2020, 6:19 a.m. UTC | #1
Acked-By: AceLan Kao <acelan.kao@canonical.com>
Timo Aaltonen Dec. 2, 2020, 6:50 a.m. UTC | #2
On 29.11.2020 19.35, Kai-Heng Feng wrote:
> BugLink: https://bugs.launchpad.net/bugs/1906168
> 
> [Impact]
> Surprising thermal shutdown at boot on Intel based mobile workstations.
> 
> [Fix]
> Since these thermal devcies are not in ACPI ThermalZone, OS shouldn't
> shutdown the system.
> 
> These critial temperatures are for usespace to handle, so let kernel
> know it shouldn't handle it.
> 
> SRU for stable kernels will be sent after the fix is in upstream.
> 
> [Test]
> Use reboot stress as a reproducer. 5% chance to see a surprising
> shutdown at boot.
> 
> With the fix applied, the thermal shutdown is no longer reproducible.
> 
> [Where problems could occur]
> For ACPI based platforms, we still have "acpitz" to protect systems from
> overheating. If these acpitz sensors don't work, then the system could
> face real overheating issue.
> 
> Kai-Heng Feng (3):
>    UBUNTU: SAUCE: thermal: core: Add indication for userspace usage
>    UBUNTU: SAUCE: thermal: int340x: Indicate userspace usage
>    UBUNTU: SAUCE: thermal: intel: intel_pch_thermal: Indicate userspace
>      usage
> 
>   drivers/thermal/intel/int340x_thermal/int3400_thermal.c     | 1 +
>   .../thermal/intel/int340x_thermal/int340x_thermal_zone.c    | 1 +
>   drivers/thermal/intel/intel_pch_thermal.c                   | 6 +++++-
>   drivers/thermal/thermal_core.c                              | 3 +++
>   include/linux/thermal.h                                     | 2 ++
>   5 files changed, 12 insertions(+), 1 deletion(-)
> 

applied to oem-5.6, thanks