skb: Propagate pfmemalloc on skb from head page only

From: Eric Dumazet <eric.dumazet@gmail.com>

Hi.

I'm trying to send big chunks of memory from application address space via
TCP socket using vmsplice + splice like this

   mem = mmap(128Mb);
   vmsplice(pipe[1], mem); /* splice memory into pipe */
   splice(pipe[0], tcp_socket); /* send it into network */

When I'm lucky and a huge page splices into the pipe and then into the socket
_and_ client and server ends of the TCP connection are on the same host,
communicating via lo, the whole connection gets stuck! The sending queue
becomes full and app stops writing/splicing more into it, but the receiving
queue remains empty, and that's why.

The __skb_fill_page_desc observes a tail page of a huge page and erroneously
propagates its page->pfmemalloc value onto socket (the pfmemalloc on tail pages
contain garbage). Then this skb->pfmemalloc leaks through lo and due to the

    tcp_v4_rcv
    sk_filter
        if (skb->pfmemalloc && !sock_flag(sk, SOCK_MEMALLOC)) /* true */
            return -ENOMEM
        goto release_and_discard;

no packets reach the socket. Even TCP re-transmits are dropped by this, as skb
cloning clones the pfmemalloc flag as well.

That said, here's the proper page->pfmemalloc propagation onto socket: we
must check the huge-page's head page only, other pages' pfmemalloc and mapping
values do not contain what is expected in this place. However, I'm not sure
whether this fix is _complete_, since pfmemalloc propagation via lo also 
oesn't look great.

Both, bit propagation from page to skb and this check in sk_filter, were 
introduced by c48a11c7 (netvm: propagate page->pfmemalloc to skb), in v3.5 so
Mel and stable@ are in Cc.

Signed-off-by: Pavel Emelyanov <xemul@parallels.com>

---

--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Message ID	5141D0C4.70409@parallels.com
State	Accepted, archived
Delegated to:	David Miller
Headers	show Return-Path: <netdev-owner@vger.kernel.org> X-Original-To: patchwork-incoming@ozlabs.org Delivered-To: patchwork-incoming@ozlabs.org Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by ozlabs.org (Postfix) with ESMTP id A78DD2C00B7 for <patchwork-incoming@ozlabs.org>; Fri, 15 Mar 2013 00:31:16 +1100 (EST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757905Ab3CNNbN (ORCPT <rfc822;patchwork-incoming@ozlabs.org>); Thu, 14 Mar 2013 09:31:13 -0400 Received: from mailhub.sw.ru ([195.214.232.25]:4272 "EHLO relay.sw.ru" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1757807Ab3CNNbM (ORCPT <rfc822;netdev@vger.kernel.org>); Thu, 14 Mar 2013 09:31:12 -0400 Received: from [10.30.16.114] ([10.30.16.114]) (authenticated bits=0) by relay.sw.ru (8.13.4/8.13.4) with ESMTP id r2EDTe6t023037 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Thu, 14 Mar 2013 17:29:41 +0400 (MSK) Message-ID: <5141D0C4.70409@parallels.com> Date: Thu, 14 Mar 2013 17:29:40 +0400 From: Pavel Emelyanov <xemul@parallels.com> User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:13.0) Gecko/20120605 Thunderbird/13.0 MIME-Version: 1.0 To: David Miller <davem@davemloft.net>, Mel Gorman <mgorman@suse.de>, Eric Dumazet <eric.dumazet@gmail.com>, Linux Netdev List <netdev@vger.kernel.org>, stable@kernel.org CC: Alexey Kuznetsov <kuznet@ms2.inr.ac.ru> Subject: [PATCH] skb: Propagate pfmemalloc on skb from head page only Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: netdev-owner@vger.kernel.org Precedence: bulk List-ID: <netdev.vger.kernel.org> X-Mailing-List: netdev@vger.kernel.org

skb: Propagate pfmemalloc on skb from head page only

Commit Message

Comments

Patch