diff mbox series

[net-next] net/mlx4_en: optimizes get_fixed_ipv6_csum()

Message ID 20180419154929.25718-1-edumazet@google.com
State Accepted, archived
Delegated to: David Miller
Headers show
Series [net-next] net/mlx4_en: optimizes get_fixed_ipv6_csum() | expand

Commit Message

Eric Dumazet April 19, 2018, 3:49 p.m. UTC
While trying to support CHECKSUM_COMPLETE for IPV6 fragments,
I had to experiments various hacks in get_fixed_ipv6_csum().
I must admit I could not find how to implement this :/

However, get_fixed_ipv6_csum() does a lot of redundant operations,
calling csum_partial() twice.

First csum_partial() computes the checksum of saddr and daddr,
put in @csum_pseudo_hdr. Undone later in the second csum_partial()
computed on whole ipv6 header.

Then nexthdr is added once, added a second time, then substracted.

payload_len is added once, then substracted.

Really all this can be reduced to two add_csum(), to add back 6 bytes
that were removed by mlx4 when providing hw_checksum in RX descriptor.

Signed-off-by: Eric Dumazet <edumazet@google.com>
Cc: Saeed Mahameed <saeedm@mellanox.com>
Cc: Tariq Toukan <tariqt@mellanox.com>
---
Note: This patch, like other mlx4 patches can definitely wait
Tariq approval, thanks !

 drivers/net/ethernet/mellanox/mlx4/en_rx.c | 21 ++++++++-------------
 1 file changed, 8 insertions(+), 13 deletions(-)

Comments

Saeed Mahameed April 26, 2018, 10:56 p.m. UTC | #1
On Thu, 2018-04-19 at 08:49 -0700, Eric Dumazet wrote:
> While trying to support CHECKSUM_COMPLETE for IPV6 fragments,
> I had to experiments various hacks in get_fixed_ipv6_csum().
> I must admit I could not find how to implement this :/
> 
> However, get_fixed_ipv6_csum() does a lot of redundant operations,
> calling csum_partial() twice.
> 
> First csum_partial() computes the checksum of saddr and daddr,
> put in @csum_pseudo_hdr. Undone later in the second csum_partial()
> computed on whole ipv6 header.
> 
> Then nexthdr is added once, added a second time, then substracted.
> 
> payload_len is added once, then substracted.
> 
> Really all this can be reduced to two add_csum(), to add back 6 bytes
> that were removed by mlx4 when providing hw_checksum in RX
> descriptor.
> 
> Signed-off-by: Eric Dumazet <edumazet@google.com>
> Cc: Saeed Mahameed <saeedm@mellanox.com>
> Cc: Tariq Toukan <tariqt@mellanox.com>
> ---
> Note: This patch, like other mlx4 patches can definitely wait
> Tariq approval, thanks !
> 

LGTM,

Reviewed-by: Saeed Mahameed <saeedm@mellanox.com>

>  drivers/net/ethernet/mellanox/mlx4/en_rx.c | 21 ++++++++----------
> ---
>  1 file changed, 8 insertions(+), 13 deletions(-)
> 
> diff --git a/drivers/net/ethernet/mellanox/mlx4/en_rx.c
> b/drivers/net/ethernet/mellanox/mlx4/en_rx.c
> index
> 5c613c6663da51a4ae792eeb4d8956b54655786b..38c56fb6e5f5970f245dd56c38e
> 1fc63a9349a07 100644
> --- a/drivers/net/ethernet/mellanox/mlx4/en_rx.c
> +++ b/drivers/net/ethernet/mellanox/mlx4/en_rx.c
> @@ -593,30 +593,25 @@ static int get_fixed_ipv4_csum(__wsum
> hw_checksum, struct sk_buff *skb,
>  }
>  
>  #if IS_ENABLED(CONFIG_IPV6)
> -/* In IPv6 packets, besides subtracting the pseudo header checksum,
> - * we also compute/add the IP header checksum which
> - * is not added by the HW.
> +/* In IPv6 packets, hw_checksum lacks 6 bytes from IPv6 header:
> + * 4 first bytes : priority, version, flow_lbl
> + * and 2 additional bytes : nexthdr, hop_limit.
>   */
>  static int get_fixed_ipv6_csum(__wsum hw_checksum, struct sk_buff
> *skb,
>  			       struct ipv6hdr *ipv6h)
>  {
>  	__u8 nexthdr = ipv6h->nexthdr;
> -	__wsum csum_pseudo_hdr = 0;
> +	__wsum temp;
>  
>  	if (unlikely(nexthdr == IPPROTO_FRAGMENT ||
>  		     nexthdr == IPPROTO_HOPOPTS ||
>  		     nexthdr == IPPROTO_SCTP))
>  		return -1;
> -	hw_checksum = csum_add(hw_checksum, (__force
> __wsum)htons(nexthdr));
>  
> -	csum_pseudo_hdr = csum_partial(&ipv6h->saddr,
> -				       sizeof(ipv6h->saddr) +
> sizeof(ipv6h->daddr), 0);
> -	csum_pseudo_hdr = csum_add(csum_pseudo_hdr, (__force
> __wsum)ipv6h->payload_len);
> -	csum_pseudo_hdr = csum_add(csum_pseudo_hdr,
> -				   (__force __wsum)htons(nexthdr));
> -
> -	skb->csum = csum_sub(hw_checksum, csum_pseudo_hdr);
> -	skb->csum = csum_add(skb->csum, csum_partial(ipv6h,
> sizeof(struct ipv6hdr), 0));
> +	/* priority, version, flow_lbl */
> +	temp = csum_add(hw_checksum, *(__wsum *)ipv6h);
> +	/* nexthdr and hop_limit */
> +	skb->csum = csum_add(temp, (__force __wsum)*(__be16
> *)&ipv6h->nexthdr);
>  	return 0;
>  }
>  #endif
Tariq Toukan May 2, 2018, 2:18 p.m. UTC | #2
On 27/04/2018 1:56 AM, Saeed Mahameed wrote:
> On Thu, 2018-04-19 at 08:49 -0700, Eric Dumazet wrote:
>> While trying to support CHECKSUM_COMPLETE for IPV6 fragments,
>> I had to experiments various hacks in get_fixed_ipv6_csum().
>> I must admit I could not find how to implement this :/
>>
>> However, get_fixed_ipv6_csum() does a lot of redundant operations,
>> calling csum_partial() twice.
>>
>> First csum_partial() computes the checksum of saddr and daddr,
>> put in @csum_pseudo_hdr. Undone later in the second csum_partial()
>> computed on whole ipv6 header.
>>
>> Then nexthdr is added once, added a second time, then substracted.
>>
>> payload_len is added once, then substracted.
>>
>> Really all this can be reduced to two add_csum(), to add back 6 bytes
>> that were removed by mlx4 when providing hw_checksum in RX
>> descriptor.
>>
>> Signed-off-by: Eric Dumazet <edumazet@google.com>
>> Cc: Saeed Mahameed <saeedm@mellanox.com>
>> Cc: Tariq Toukan <tariqt@mellanox.com>
>> ---
>> Note: This patch, like other mlx4 patches can definitely wait
>> Tariq approval, thanks !
>>
> 
> LGTM,
> 
> Reviewed-by: Saeed Mahameed <saeedm@mellanox.com>
> 

Acked-by: Tariq Toukan <tariqt@mellanox.com>

Thanks Eric.

>>   drivers/net/ethernet/mellanox/mlx4/en_rx.c | 21 ++++++++----------
>> ---
>>   1 file changed, 8 insertions(+), 13 deletions(-)
>>
>> diff --git a/drivers/net/ethernet/mellanox/mlx4/en_rx.c
>> b/drivers/net/ethernet/mellanox/mlx4/en_rx.c
>> index
>> 5c613c6663da51a4ae792eeb4d8956b54655786b..38c56fb6e5f5970f245dd56c38e
>> 1fc63a9349a07 100644
>> --- a/drivers/net/ethernet/mellanox/mlx4/en_rx.c
>> +++ b/drivers/net/ethernet/mellanox/mlx4/en_rx.c
>> @@ -593,30 +593,25 @@ static int get_fixed_ipv4_csum(__wsum
>> hw_checksum, struct sk_buff *skb,
>>   }
>>   
>>   #if IS_ENABLED(CONFIG_IPV6)
>> -/* In IPv6 packets, besides subtracting the pseudo header checksum,
>> - * we also compute/add the IP header checksum which
>> - * is not added by the HW.
>> +/* In IPv6 packets, hw_checksum lacks 6 bytes from IPv6 header:
>> + * 4 first bytes : priority, version, flow_lbl
>> + * and 2 additional bytes : nexthdr, hop_limit.
>>    */
>>   static int get_fixed_ipv6_csum(__wsum hw_checksum, struct sk_buff
>> *skb,
>>   			       struct ipv6hdr *ipv6h)
>>   {
>>   	__u8 nexthdr = ipv6h->nexthdr;
>> -	__wsum csum_pseudo_hdr = 0;
>> +	__wsum temp;
>>   
>>   	if (unlikely(nexthdr == IPPROTO_FRAGMENT ||
>>   		     nexthdr == IPPROTO_HOPOPTS ||
>>   		     nexthdr == IPPROTO_SCTP))
>>   		return -1;
>> -	hw_checksum = csum_add(hw_checksum, (__force
>> __wsum)htons(nexthdr));
>>   
>> -	csum_pseudo_hdr = csum_partial(&ipv6h->saddr,
>> -				       sizeof(ipv6h->saddr) +
>> sizeof(ipv6h->daddr), 0);
>> -	csum_pseudo_hdr = csum_add(csum_pseudo_hdr, (__force
>> __wsum)ipv6h->payload_len);
>> -	csum_pseudo_hdr = csum_add(csum_pseudo_hdr,
>> -				   (__force __wsum)htons(nexthdr));
>> -
>> -	skb->csum = csum_sub(hw_checksum, csum_pseudo_hdr);
>> -	skb->csum = csum_add(skb->csum, csum_partial(ipv6h,
>> sizeof(struct ipv6hdr), 0));
>> +	/* priority, version, flow_lbl */
>> +	temp = csum_add(hw_checksum, *(__wsum *)ipv6h);
>> +	/* nexthdr and hop_limit */
>> +	skb->csum = csum_add(temp, (__force __wsum)*(__be16
>> *)&ipv6h->nexthdr);
>>   	return 0;
>>   }
>>   #endif
Eric Dumazet May 4, 2018, 12:05 a.m. UTC | #3
On 05/02/2018 07:18 AM, Tariq Toukan wrote:
> 
> 
> On 27/04/2018 1:56 AM, Saeed Mahameed wrote:

>> LGTM,
>>
>> Reviewed-by: Saeed Mahameed <saeedm@mellanox.com>
>>
> 
> Acked-by: Tariq Toukan <tariqt@mellanox.com>
> 
> Thanks Eric.

Thanks guys. 

I see this patch  ( http://patchwork.ozlabs.org/patch/901336/ ) in
a state I do not know : "Awaiting Upstream"
David Miller May 4, 2018, 1:52 a.m. UTC | #4
From: Eric Dumazet <eric.dumazet@gmail.com>
Date: Thu, 3 May 2018 17:05:06 -0700

> 
> 
> On 05/02/2018 07:18 AM, Tariq Toukan wrote:
>> 
>> 
>> On 27/04/2018 1:56 AM, Saeed Mahameed wrote:
> 
>>> LGTM,
>>>
>>> Reviewed-by: Saeed Mahameed <saeedm@mellanox.com>
>>>
>> 
>> Acked-by: Tariq Toukan <tariqt@mellanox.com>
>> 
>> Thanks Eric.
> 
> Thanks guys. 
> 
> I see this patch  ( http://patchwork.ozlabs.org/patch/901336/ ) in
> a state I do not know : "Awaiting Upstream"

THat means I expect to see this change from the upstream
maintainer, which in this case is Tariq.
Eric Dumazet May 4, 2018, 2:10 a.m. UTC | #5
On 05/03/2018 06:52 PM, David Miller wrote:
> From: Eric Dumazet <eric.dumazet@gmail.com>
> Date: Thu, 3 May 2018 17:05:06 -0700
> 
>>
>>
>> On 05/02/2018 07:18 AM, Tariq Toukan wrote:
>>>
>>>
>>> On 27/04/2018 1:56 AM, Saeed Mahameed wrote:
>>
>>>> LGTM,
>>>>
>>>> Reviewed-by: Saeed Mahameed <saeedm@mellanox.com>
>>>>
>>>
>>> Acked-by: Tariq Toukan <tariqt@mellanox.com>
>>>
>>> Thanks Eric.
>>
>> Thanks guys. 
>>
>> I see this patch  ( http://patchwork.ozlabs.org/patch/901336/ ) in
>> a state I do not know : "Awaiting Upstream"
> 
> THat means I expect to see this change from the upstream
> maintainer, which in this case is Tariq.
> 

I see, but it seems Tariq does not know that, otherwise he would
not have sent an "Acked-by:"

I guess this will need an extra round-trip ...
David Miller May 4, 2018, 3:59 p.m. UTC | #6
From: Eric Dumazet <eric.dumazet@gmail.com>
Date: Thu, 3 May 2018 19:10:29 -0700

> 
> 
> On 05/03/2018 06:52 PM, David Miller wrote:
>> From: Eric Dumazet <eric.dumazet@gmail.com>
>> Date: Thu, 3 May 2018 17:05:06 -0700
>> 
>>>
>>>
>>> On 05/02/2018 07:18 AM, Tariq Toukan wrote:
>>>>
>>>>
>>>> On 27/04/2018 1:56 AM, Saeed Mahameed wrote:
>>>
>>>>> LGTM,
>>>>>
>>>>> Reviewed-by: Saeed Mahameed <saeedm@mellanox.com>
>>>>>
>>>>
>>>> Acked-by: Tariq Toukan <tariqt@mellanox.com>
>>>>
>>>> Thanks Eric.
>>>
>>> Thanks guys. 
>>>
>>> I see this patch  ( http://patchwork.ozlabs.org/patch/901336/ ) in
>>> a state I do not know : "Awaiting Upstream"
>> 
>> THat means I expect to see this change from the upstream
>> maintainer, which in this case is Tariq.
>> 
> 
> I see, but it seems Tariq does not know that, otherwise he would
> not have sent an "Acked-by:"
> 
> I guess this will need an extra round-trip ...

No need to extra round-trip, I applied it directly ;)
Eric Dumazet May 4, 2018, 4:10 p.m. UTC | #7
On 05/04/2018 08:59 AM, David Miller wrote:

> 
> No need to extra round-trip, I applied it directly ;)
> 

Very nice, thanks David !
diff mbox series

Patch

diff --git a/drivers/net/ethernet/mellanox/mlx4/en_rx.c b/drivers/net/ethernet/mellanox/mlx4/en_rx.c
index 5c613c6663da51a4ae792eeb4d8956b54655786b..38c56fb6e5f5970f245dd56c38e1fc63a9349a07 100644
--- a/drivers/net/ethernet/mellanox/mlx4/en_rx.c
+++ b/drivers/net/ethernet/mellanox/mlx4/en_rx.c
@@ -593,30 +593,25 @@  static int get_fixed_ipv4_csum(__wsum hw_checksum, struct sk_buff *skb,
 }
 
 #if IS_ENABLED(CONFIG_IPV6)
-/* In IPv6 packets, besides subtracting the pseudo header checksum,
- * we also compute/add the IP header checksum which
- * is not added by the HW.
+/* In IPv6 packets, hw_checksum lacks 6 bytes from IPv6 header:
+ * 4 first bytes : priority, version, flow_lbl
+ * and 2 additional bytes : nexthdr, hop_limit.
  */
 static int get_fixed_ipv6_csum(__wsum hw_checksum, struct sk_buff *skb,
 			       struct ipv6hdr *ipv6h)
 {
 	__u8 nexthdr = ipv6h->nexthdr;
-	__wsum csum_pseudo_hdr = 0;
+	__wsum temp;
 
 	if (unlikely(nexthdr == IPPROTO_FRAGMENT ||
 		     nexthdr == IPPROTO_HOPOPTS ||
 		     nexthdr == IPPROTO_SCTP))
 		return -1;
-	hw_checksum = csum_add(hw_checksum, (__force __wsum)htons(nexthdr));
 
-	csum_pseudo_hdr = csum_partial(&ipv6h->saddr,
-				       sizeof(ipv6h->saddr) + sizeof(ipv6h->daddr), 0);
-	csum_pseudo_hdr = csum_add(csum_pseudo_hdr, (__force __wsum)ipv6h->payload_len);
-	csum_pseudo_hdr = csum_add(csum_pseudo_hdr,
-				   (__force __wsum)htons(nexthdr));
-
-	skb->csum = csum_sub(hw_checksum, csum_pseudo_hdr);
-	skb->csum = csum_add(skb->csum, csum_partial(ipv6h, sizeof(struct ipv6hdr), 0));
+	/* priority, version, flow_lbl */
+	temp = csum_add(hw_checksum, *(__wsum *)ipv6h);
+	/* nexthdr and hop_limit */
+	skb->csum = csum_add(temp, (__force __wsum)*(__be16 *)&ipv6h->nexthdr);
 	return 0;
 }
 #endif