{"id":2223759,"url":"http://patchwork.ozlabs.org/api/1.1/patches/2223759/?format=json","web_url":"http://patchwork.ozlabs.org/project/linux-pci/patch/20260416070707.3242381-1-yuan.gao@ucloud.cn/","project":{"id":28,"url":"http://patchwork.ozlabs.org/api/1.1/projects/28/?format=json","name":"Linux PCI development","link_name":"linux-pci","list_id":"linux-pci.vger.kernel.org","list_email":"linux-pci@vger.kernel.org","web_url":null,"scm_url":null,"webscm_url":null},"msgid":"<20260416070707.3242381-1-yuan.gao@ucloud.cn>","date":"2026-04-16T07:07:06","name":"PCI: Avoid FLR for NVIDIA 5090 GPU","commit_ref":null,"pull_url":null,"state":"new","archived":false,"hash":"0ddbb5b987884ac22673af63db3fae4cc23569c8","submitter":{"id":93167,"url":"http://patchwork.ozlabs.org/api/1.1/people/93167/?format=json","name":"yuan.gao","email":"yuan.gao@ucloud.cn"},"delegate":null,"mbox":"http://patchwork.ozlabs.org/project/linux-pci/patch/20260416070707.3242381-1-yuan.gao@ucloud.cn/mbox/","series":[{"id":500086,"url":"http://patchwork.ozlabs.org/api/1.1/series/500086/?format=json","web_url":"http://patchwork.ozlabs.org/project/linux-pci/list/?series=500086","date":"2026-04-16T07:07:06","name":"PCI: Avoid FLR for NVIDIA 5090 GPU","version":1,"mbox":"http://patchwork.ozlabs.org/series/500086/mbox/"}],"comments":"http://patchwork.ozlabs.org/api/patches/2223759/comments/","check":"pending","checks":"http://patchwork.ozlabs.org/api/patches/2223759/checks/","tags":{},"headers":{"Return-Path":"\n <linux-pci+bounces-52574-incoming=patchwork.ozlabs.org@vger.kernel.org>","X-Original-To":["incoming@patchwork.ozlabs.org","linux-pci@vger.kernel.org"],"Delivered-To":"patchwork-incoming@legolas.ozlabs.org","Authentication-Results":["legolas.ozlabs.org;\n\tdkim=pass (1024-bit key;\n unprotected) header.d=ucloud.cn header.i=@ucloud.cn header.a=rsa-sha256\n header.s=default header.b=SKBOGdYW;\n\tdkim-atps=neutral","legolas.ozlabs.org;\n spf=pass (sender SPF authorized) smtp.mailfrom=vger.kernel.org\n (client-ip=2600:3c0a:e001:db::12fc:5321; helo=sea.lore.kernel.org;\n envelope-from=linux-pci+bounces-52574-incoming=patchwork.ozlabs.org@vger.kernel.org;\n receiver=patchwork.ozlabs.org)","smtp.subspace.kernel.org;\n\tdkim=pass (1024-bit key) header.d=ucloud.cn header.i=@ucloud.cn\n header.b=\"SKBOGdYW\"","smtp.subspace.kernel.org;\n arc=none smtp.client-ip=45.254.49.209","smtp.subspace.kernel.org;\n dmarc=pass (p=quarantine dis=none) header.from=ucloud.cn","smtp.subspace.kernel.org;\n spf=pass smtp.mailfrom=ucloud.cn"],"Received":["from sea.lore.kernel.org (sea.lore.kernel.org\n [IPv6:2600:3c0a:e001:db::12fc:5321])\n\t(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)\n\t key-exchange x25519 server-signature ECDSA (secp384r1) server-digest SHA384)\n\t(No client certificate requested)\n\tby legolas.ozlabs.org (Postfix) with ESMTPS id 4fx8Pq1r7Gz1yCv\n\tfor <incoming@patchwork.ozlabs.org>; Thu, 16 Apr 2026 17:13:51 +1000 (AEST)","from smtp.subspace.kernel.org (conduit.subspace.kernel.org\n [100.90.174.1])\n\tby sea.lore.kernel.org (Postfix) with ESMTP id E08D9308018F\n\tfor <incoming@patchwork.ozlabs.org>; Thu, 16 Apr 2026 07:12:47 +0000 (UTC)","from localhost.localdomain (localhost.localdomain [127.0.0.1])\n\tby smtp.subspace.kernel.org (Postfix) with ESMTP id E545A381B0D;\n\tThu, 16 Apr 2026 07:12:46 +0000 (UTC)","from mail-m49209.qiye.163.com (mail-m49209.qiye.163.com\n [45.254.49.209])\n\t(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))\n\t(No client certificate requested)\n\tby smtp.subspace.kernel.org (Postfix) with ESMTPS id 29CAA37F00D;\n\tThu, 16 Apr 2026 07:12:43 +0000 (UTC)","from yuangap.. (unknown [106.75.220.2])\n\tby smtp.qiye.163.com (Hmail) with ESMTP id 18fc72fa3;\n\tThu, 16 Apr 2026 15:07:26 +0800 (GMT+08:00)"],"ARC-Seal":"i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116;\n\tt=1776323566; cv=none;\n b=EPMZhZni/ugRNMf2uGwjKMKhz7ZDEbAXlaoXMrq9k4iUnOijq09QbZD6/0Xbvtt4Ly0djrhe1QgSDJ4kLKdNF6P79vmHAHYg+iHM0dF2WlLHuc1CFG0FTVgd7JII9oWNiTO2i8pHsPa/Qa/owsq1zxhH5vPc6rPa0EwCtrCa+fY=","ARC-Message-Signature":"i=1; a=rsa-sha256; d=subspace.kernel.org;\n\ts=arc-20240116; t=1776323566; c=relaxed/simple;\n\tbh=DGoUtmfUyJA820hOjw4yi3q4y8EGWylhVe7SKv4s89k=;\n\th=From:To:Cc:Subject:Date:Message-Id:MIME-Version:Content-Type;\n b=El4ZAh96iquWjFFuYgriOypVYl3inc0FtvrAy/zzZk7zZIxkOhbaOAWlN2rKL9ZGYOR6eDltoDi93rYw5W0iN5PzwRlobF/eIdu4xKscYKxkjQNSfbRbJjnMUrwNt2UnMeiW6ChO9SooX7CAtyagHpOYQzOcGFqxwLUhTpwxN2g=","ARC-Authentication-Results":"i=1; smtp.subspace.kernel.org;\n dmarc=pass (p=quarantine dis=none) header.from=ucloud.cn;\n spf=pass smtp.mailfrom=ucloud.cn;\n dkim=pass (1024-bit key) header.d=ucloud.cn header.i=@ucloud.cn\n header.b=SKBOGdYW; arc=none smtp.client-ip=45.254.49.209","From":"\"yuan.gao\" <yuan.gao@ucloud.cn>","To":"Bjorn Helgaas <bhelgaas@google.com>,\n\tlinux-pci@vger.kernel.org,\n\tlinux-kernel@vger.kernel.org","Cc":"\"yuan.gao\" <yuan.gao@ucloud.cn>","Subject":"[PATCH] PCI: Avoid FLR for NVIDIA 5090 GPU","Date":"Thu, 16 Apr 2026 15:07:06 +0800","Message-Id":"<20260416070707.3242381-1-yuan.gao@ucloud.cn>","X-Mailer":"git-send-email 2.34.1","Precedence":"bulk","X-Mailing-List":"linux-pci@vger.kernel.org","List-Id":"<linux-pci.vger.kernel.org>","List-Subscribe":"<mailto:linux-pci+subscribe@vger.kernel.org>","List-Unsubscribe":"<mailto:linux-pci+unsubscribe@vger.kernel.org>","MIME-Version":"1.0","Content-Type":"text/plain; charset=UTF-8","Content-Transfer-Encoding":"8bit","X-HM-Tid":"0a9d951dba860229kunm87b7e6a7dff59","X-HM-MType":"1","X-HM-Spam-Status":"e1kfGhgUHx5ZQUpXWQgPGg8OCBgUHx5ZQUlOS1dZFg8aDwILHllBWSg2Ly\n\ttZV1koWUFJQjdXWS1ZQUlXWQ8JGhUIEh9ZQVlDTElMVkIdH0lMSx5CQxlKHVYVFAkWGhdVGRETFh\n\toSFyQUDg9ZV1kYEgtZQVlKS01VTE5VSUlLVUlZV1kWGg8SFR0UWUFZT0tIVUpLSEtKSE1VSktLVU\n\ttZBg++","DKIM-Signature":"a=rsa-sha256;\n\tb=SKBOGdYWTW2GqidaVwTyFFLXpmLMm0Rlr2hYPFhLYJNEV0Kz7eOK5FPeC0xM9pwJabXJVBtzT4lO503LySi2Zix+R5wRA1YkMVcGNpRJVU2jWcg3rjYgv9N9ob81yXrAcaMahrQ/m7eiqe6XnaK1YxZeulQ4WawIOK0/DAsXVps=;\n c=relaxed/relaxed; s=default; d=ucloud.cn; v=1;\n\tbh=sNStje91iAC4EO4tWOP9oKrQHdpAKriZjxYmfZ0ZIKE=;\n\th=date:mime-version:subject:message-id:from;"},"content":"When passing through the NVIDIA 5090 GPU to a vm, there is a certain\nprobability of encountering an flr timeout during vm shutdown, which\nsubsequently leads to a soft lock of the host cpu.\n\nAs described in this post\n(https://forum.level1techs.com/t/do-your-rtx-5090-or-general-rtx-50-series-has-reset-bug-in-vm-passthrough/228549).\n\nAnd in dmesg:\n\n [401106.011979] vfio-pci 0000:d8:00.0: not ready 1023ms after FLR; waiting\n [401108.700074] vfio-pci 0000:d8:00.0: not ready 2047ms after FLR; waiting\n [401112.412204] vfio-pci 0000:d8:00.0: not ready 4095ms after FLR; waiting\n [401118.620399] vfio-pci 0000:d8:00.0: not ready 8191ms after FLR; waiting\n [401128.860788] vfio-pci 0000:d8:00.0: not ready 16383ms after FLR; waiting\n [401147.293518] vfio-pci 0000:d8:00.0: not ready 32767ms after FLR; waiting\n [401185.694859] vfio-pci 0000:d8:00.0: not ready 65535ms after FLR; giving up\n [401195.372583] vfio-pci 0000:38:00.2: Relaying device request to user (#0)\n\n [401208.274941] watchdog: BUG: soft lockup - CPU#11 stuck for 21s! [CPU 22/KVM:30337]\n\n [401209.887848] CPU: 11 PID: 30337 Comm: CPU 22/KVM Kdump: loaded Not tainted\n [401209.887854] RIP: 0010:pci_mmcfg_read+0xaa/0xd0\n\n [401209.887866] Call Trace:\n [401209.887872]  pci_bus_read_config_dword+0x43/0x70\n [401209.b887876]  pci_find_next_ext_capability.part.20+0x65/0xc0\n [401209.887879]  pci_restore_state.part.39+0x6d/0x3f0\n [401209.887883]  vfio_pci_disable+0x22b/0x4d0 [vfio_pci]\n [401209.887886]  ? __dentry_kill+0x118/0x160\n [401209.887888]  vfio_pci_release+0x5a/0xb0 [vfio_pci]\n [401209.887891]  vfio_device_fops_release+0x18/0x30 [vfio]\n [401209.887894]  __fput+0x98/0x240\n [401209.887897]  task_work_run+0x6a/0xa0\n [401209.887899]  do_exit+0x375/0xb10\n [401209.887900]  do_group_exit+0x3a/0xa0\n [401209.887902]  get_signal+0x140/0x7d0\n [401209.887906]  arch_do_signal+0x2c/0x260\n [401209.887909]  exit_to_user_mode_prepare+0xc0/0x120\n [401209.887912]  syscall_exit_to_user_mode+0x27/0x180\n [401209.887915]  entry_SYSCALL_64_after_hwframe+0x44/0xa9\n\nThe flr seems to have some issues on the NVIDIA 5090 GPU,\nso I’ve added flr-related quirks for these devices.\n\nAnd with this patch in place, the host kernel doesn't exhibit these\nproblems. The vm starts up and works as expected with the passed-through\nNVIDIA 5090 GPU.\n\nSigned-off-by: yuan.gao <yuan.gao@ucloud.cn>\n---\n drivers/pci/quirks.c | 3 +++\n 1 file changed, 3 insertions(+)","diff":"diff --git a/drivers/pci/quirks.c b/drivers/pci/quirks.c\nindex 48946cca4be72..71f833f3e2d84 100644\n--- a/drivers/pci/quirks.c\n+++ b/drivers/pci/quirks.c\n@@ -5618,6 +5618,9 @@ DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_AMD, 0x7901, quirk_no_flr);\n DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_INTEL, 0x1502, quirk_no_flr);\n DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_INTEL, 0x1503, quirk_no_flr);\n DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_MEDIATEK, 0x0616, quirk_no_flr);\n+DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_NVIDIA, 0x2b85, quirk_no_flr);\n+DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_NVIDIA, 0x2b87, quirk_no_flr);\n+DECLARE_PCI_FIXUP_EARLY(PCI_VENDOR_ID_NVIDIA, 0x2b8c, quirk_no_flr);\n \n /* FLR may cause the SolidRun SNET DPU (rev 0x1) to hang */\n static void quirk_no_flr_snet(struct pci_dev *dev)\n","prefixes":[]}