[ovs-dev] ovn: Add second ACL stage

Message ID	1469767624-20966-1-git-send-email-mickeys.dev@gmail.com
State	Superseded
Headers	show Return-Path: <dev-bounces@openvswitch.org> Received-SPF: pass (mx3-pf2.cudamail.com: SPF record at _netblocks.google.com designates 209.85.220.67 as permitted sender) From: Mickey Spiegel <mickeys.dev@gmail.com> To: dev@openvswitch.org Date: Thu, 28 Jul 2016 21:47:04 -0700 ovn: Add second ACL stage Message-Id: <1469767624-20966-1-git-send-email-mickeys.dev@gmail.com> Mail: message has a signature 0.10 RDNS_NONE Delivered to trusted network by a host with no rDNS 0.50 BSF_SC5_MJ1963 Custom Rule MJ1963 Subject: [ovs-dev] [PATCH] ovn: Add second ACL stage Precedence: list MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: dev-bounces@openvswitch.org Sender: "dev" <dev-bounces@openvswitch.org>

Mickey Spiegel July 29, 2016, 4:47 a.m. UTC

From: Mickey Spiegel <emspiege@us.ibm.com>

This patch adds a second logical switch ingress ACL stage, and
correspondingly a second logical switch egress ACL stage.  This
allows for more than one ACL-based feature to be applied in the
ingress and egress logical switch pipelines.  The features
driving the different ACL stages may be configured by different
users, for example an application deployer managing security
groups and a network or security admin configuring network ACLs
or firewall rules.

Each ACL stage is self contained.  The "action" for the
highest-"priority" matching row in an ACL stage determines a
packet's treatment.  A separate "action" will be determined in
each ACL stage, according to the ACL rules configured for that
ACL stage.  The "priority" values are only relevant within the
context of an ACL stage.

ACL rules that do not specify an ACL stage are applied to the
default "acl" stage.

Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>
---
 ovn/northd/ovn-northd.c   | 319 +++++++++++++++++++++++++++-------------------
 ovn/ovn-nb.ovsschema      |   7 +-
 ovn/ovn-nb.xml            |  25 ++++
 ovn/utilities/ovn-nbctl.c |  35 +++--
 tests/ovn-nbctl.at        |  30 +++--
 5 files changed, 264 insertions(+), 152 deletions(-)

Russell Bryant July 29, 2016, 5:01 p.m. UTC | #1

On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <mickeys.dev@gmail.com>
wrote:

>
> This patch adds a second logical switch ingress ACL stage, and
> correspondingly a second logical switch egress ACL stage.  This
> allows for more than one ACL-based feature to be applied in the
> ingress and egress logical switch pipelines.  The features
> driving the different ACL stages may be configured by different
> users, for example an application deployer managing security
> groups and a network or security admin configuring network ACLs
> or firewall rules.
>
> Each ACL stage is self contained.  The "action" for the
> highest-"priority" matching row in an ACL stage determines a
> packet's treatment.  A separate "action" will be determined in
> each ACL stage, according to the ACL rules configured for that
> ACL stage.  The "priority" values are only relevant within the
> context of an ACL stage.
>
> ACL rules that do not specify an ACL stage are applied to the
> default "acl" stage.
>
> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>


Could you expand on why priorities in a single stage aren't enough to
satisfy the use case?

Mickey Spiegel July 29, 2016, 5:28 p.m. UTC | #2

-----"dev" <dev-bounces@openvswitch.org> wrote: -----
To: Mickey Spiegel <mickeys.dev@gmail.com>
From: Russell Bryant 
Sent by: "dev" 
Date: 07/29/2016 10:02AM
Cc: ovs dev <dev@openvswitch.org>
Subject: Re: [ovs-dev] [PATCH] ovn: Add second ACL stage

On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <mickeys.dev@gmail.com>
wrote:

>
> This patch adds a second logical switch ingress ACL stage, and
> correspondingly a second logical switch egress ACL stage.  This
> allows for more than one ACL-based feature to be applied in the
> ingress and egress logical switch pipelines.  The features
> driving the different ACL stages may be configured by different
> users, for example an application deployer managing security
> groups and a network or security admin configuring network ACLs
> or firewall rules.
>
> Each ACL stage is self contained.  The "action" for the
> highest-"priority" matching row in an ACL stage determines a
> packet's treatment.  A separate "action" will be determined in
> each ACL stage, according to the ACL rules configured for that
> ACL stage.  The "priority" values are only relevant within the
> context of an ACL stage.
>
> ACL rules that do not specify an ACL stage are applied to the
> default "acl" stage.
>
> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>

Could you expand on why priorities in a single stage aren't enough to
satisfy the use case?

<Mickey>
If two features are configured independently with a mix of
prioritized allow and drop rules, then with a single stage, a
new set of ACL rules must be produced that achieves the same
behavior.  This is sometimes referred to as an "ACL merge"
algorithm, for example:
http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514

In the worst case, for example when the features act on different
packet fields (e.g. one on IP address and another on L4 port),
the number of rules required can approach
(# of ACL1 rules) * (# of ACL2 rules).

While it is possible to code up such an algorithm, it adds
significant complexity and complicates whichever layer
implements the merge algorithm, either OVN or the CMS above.

By using multiple independent pipeline stages, all of this
software complexity is avoided, achieving the proper result
in a simple and straightforward manner.

Recent network hardware ASICs tend to have around 8 or 10 ACL
stages, though they tend to evaluate these in parallel given
all the emphasis on low latency these days.

Mickey

Mickey Spiegel July 30, 2016, 8:19 p.m. UTC | #3

On Fri, Jul 29, 2016 at 10:28 AM, Mickey Spiegel <emspiege@us.ibm.com>
wrote:
>
> -----"dev" <dev-bounces@openvswitch.org> wrote: -----
>> To: Mickey Spiegel <mickeys.dev@gmail.com>
>> From: Russell Bryant
>> Sent by: "dev"
>> Date: 07/29/2016 10:02AM
>> Cc: ovs dev <dev@openvswitch.org>
>> Subject: Re: [ovs-dev] [PATCH] ovn: Add second ACL stage
>>
>> On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <mickeys.dev@gmail.com>
>> wrote:
>>
>>>
>>> This patch adds a second logical switch ingress ACL stage, and
>>> correspondingly a second logical switch egress ACL stage.  This
>>> allows for more than one ACL-based feature to be applied in the
>>> ingress and egress logical switch pipelines.  The features
>>> driving the different ACL stages may be configured by different
>>> users, for example an application deployer managing security
>>> groups and a network or security admin configuring network ACLs
>>> or firewall rules.
>>>
>>> Each ACL stage is self contained.  The "action" for the
>>> highest-"priority" matching row in an ACL stage determines a
>>> packet's treatment.  A separate "action" will be determined in
>>> each ACL stage, according to the ACL rules configured for that
>>> ACL stage.  The "priority" values are only relevant within the
>>> context of an ACL stage.
>>>
>>> ACL rules that do not specify an ACL stage are applied to the
>>> default "acl" stage.
>>>
>>> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>
>>
>>
>> Could you expand on why priorities in a single stage aren't enough to
>> satisfy the use case?
>
> If two features are configured independently with a mix of
> prioritized allow and drop rules, then with a single stage, a
> new set of ACL rules must be produced that achieves the same
> behavior.  This is sometimes referred to as an "ACL merge"
> algorithm, for example:
>
http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514
>
> In the worst case, for example when the features act on different
> packet fields (e.g. one on IP address and another on L4 port),
> the number of rules required can approach
> (# of ACL1 rules) * (# of ACL2 rules).
>
> While it is possible to code up such an algorithm, it adds
> significant complexity and complicates whichever layer
> implements the merge algorithm, either OVN or the CMS above.
>
> By using multiple independent pipeline stages, all of this
> software complexity is avoided, achieving the proper result
> in a simple and straightforward manner.
>
> Recent network hardware ASICs tend to have around 8 or 10 ACL
> stages, though they tend to evaluate these in parallel given
> all the emphasis on low latency these days.

Throwing in an example to illustrate the difference between one
ACL stage and two ACL stages:

If two separate ACL stages:
Feature 1
acl  from-lport  100 (tcp == 80) allow-related
acl  from-lport  100 (tcp == 8080) allow-related
acl  from-lport  100 (udp) allow-related
acl  from-lport  100 (ip4.src == 10.1.1.0/24 && tcp) allow-related

Feature 2
acl2 from-lport  300 (ip4.dst == 172.16.10.0/24) allow-related
acl2 from-lport  300 (ip4.dst == 192.168.20.0/24) allow-related
acl2 from-lport  200 (ip4.dst == 172.16.0.0/20) drop
acl2 from-lport  200 (ip4.dst == 192.168.0.0/16) drop
acl2 from-lport  100 (ip4.dst == 172.16.0.0/16) allow-related

Combined in one stage, to get the equivalent behavior, this would require:
from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 80) allow-related
from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 8080) allow-related
from-lport  300 (ip4.dst == 172.16.10.0/24 && udp) allow-related
from-lport  300 (ip4.dst == 172.16.10.0/24 && ip4.src == 10.1.1.0/24 &&
tcp) allow-related
from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 80) allow-related
from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 8080) allow-related
from-lport  300 (ip4.dst == 192.168.20.0/24 && udp) allow-related
from-lport  300 (ip4.dst == 192.168.20.0/24 && ip4.src == 10.1.1.0/24 &&
tcp) allow-related
from-lport  200 (ip4.dst == 172.16.0.0/20) drop
from-lport  200 (ip4.dst == 192.168.0.0/16) drop
from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 80) allow-related
from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 8080) allow-related
from-lport  100 (ip4.dst == 172.16.0.0/16 && udp) allow-related
from-lport  100 (ip4.dst == 172.16.0.0/16 && ip4.src == 10.1.1.0/24 && tcp)
allow-related

If there are more IP addresses in feature 2, then the number
of ACL rules will climb geometrically:
(4 feature 1 rules * # feature 2 allow-related rules + # feature 2 drop
rules)

With 2 separate ACL stages, the rules just go straight into
the corresponding ACL table, no merge required:
(# feature 1 rules + # feature 2 rules)

Mickey

> Mickey
>
>>
>> --
>> Russell Bryant
>> _______________________________________________
>> dev mailing list
>> dev@openvswitch.org
>> http://openvswitch.org/mailman/listinfo/dev
>>

Russell Bryant Aug. 2, 2016, 11:52 a.m. UTC | #4

On Sat, Jul 30, 2016 at 4:19 PM, Mickey Spiegel <mickeys.dev@gmail.com>
wrote:

> On Fri, Jul 29, 2016 at 10:28 AM, Mickey Spiegel <emspiege@us.ibm.com>
> wrote:
> >
> > -----"dev" <dev-bounces@openvswitch.org> wrote: -----
> >> To: Mickey Spiegel <mickeys.dev@gmail.com>
> >> From: Russell Bryant
> >> Sent by: "dev"
> >> Date: 07/29/2016 10:02AM
> >> Cc: ovs dev <dev@openvswitch.org>
> >> Subject: Re: [ovs-dev] [PATCH] ovn: Add second ACL stage
> >>
> >> On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <mickeys.dev@gmail.com
> >
> >> wrote:
> >>
> >>>
> >>> This patch adds a second logical switch ingress ACL stage, and
> >>> correspondingly a second logical switch egress ACL stage.  This
> >>> allows for more than one ACL-based feature to be applied in the
> >>> ingress and egress logical switch pipelines.  The features
> >>> driving the different ACL stages may be configured by different
> >>> users, for example an application deployer managing security
> >>> groups and a network or security admin configuring network ACLs
> >>> or firewall rules.
> >>>
> >>> Each ACL stage is self contained.  The "action" for the
> >>> highest-"priority" matching row in an ACL stage determines a
> >>> packet's treatment.  A separate "action" will be determined in
> >>> each ACL stage, according to the ACL rules configured for that
> >>> ACL stage.  The "priority" values are only relevant within the
> >>> context of an ACL stage.
> >>>
> >>> ACL rules that do not specify an ACL stage are applied to the
> >>> default "acl" stage.
> >>>
> >>> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>
> >>
> >>
> >> Could you expand on why priorities in a single stage aren't enough to
> >> satisfy the use case?
> >
> > If two features are configured independently with a mix of
> > prioritized allow and drop rules, then with a single stage, a
> > new set of ACL rules must be produced that achieves the same
> > behavior.  This is sometimes referred to as an "ACL merge"
> > algorithm, for example:
> >
> http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514
> >
> > In the worst case, for example when the features act on different
> > packet fields (e.g. one on IP address and another on L4 port),
> > the number of rules required can approach
> > (# of ACL1 rules) * (# of ACL2 rules).
> >
> > While it is possible to code up such an algorithm, it adds
> > significant complexity and complicates whichever layer
> > implements the merge algorithm, either OVN or the CMS above.
> >
> > By using multiple independent pipeline stages, all of this
> > software complexity is avoided, achieving the proper result
> > in a simple and straightforward manner.
> >
> > Recent network hardware ASICs tend to have around 8 or 10 ACL
> > stages, though they tend to evaluate these in parallel given
> > all the emphasis on low latency these days.
>
> Throwing in an example to illustrate the difference between one
> ACL stage and two ACL stages:
>
> If two separate ACL stages:
> Feature 1
> acl  from-lport  100 (tcp == 80) allow-related
> acl  from-lport  100 (tcp == 8080) allow-related
> acl  from-lport  100 (udp) allow-related
> acl  from-lport  100 (ip4.src == 10.1.1.0/24 && tcp) allow-related
>
> Feature 2
> acl2 from-lport  300 (ip4.dst == 172.16.10.0/24) allow-related
> acl2 from-lport  300 (ip4.dst == 192.168.20.0/24) allow-related
> acl2 from-lport  200 (ip4.dst == 172.16.0.0/20) drop
> acl2 from-lport  200 (ip4.dst == 192.168.0.0/16) drop
> acl2 from-lport  100 (ip4.dst == 172.16.0.0/16) allow-related
>
> Combined in one stage, to get the equivalent behavior, this would require:
> from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 80) allow-related
> from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 8080) allow-related
> from-lport  300 (ip4.dst == 172.16.10.0/24 && udp) allow-related
> from-lport  300 (ip4.dst == 172.16.10.0/24 && ip4.src == 10.1.1.0/24 &&
> tcp) allow-related
> from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 80) allow-related
> from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 8080) allow-related
> from-lport  300 (ip4.dst == 192.168.20.0/24 && udp) allow-related
> from-lport  300 (ip4.dst == 192.168.20.0/24 && ip4.src == 10.1.1.0/24 &&
> tcp) allow-related
> from-lport  200 (ip4.dst == 172.16.0.0/20) drop
> from-lport  200 (ip4.dst == 192.168.0.0/16) drop
> from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 80) allow-related
> from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 8080) allow-related
> from-lport  100 (ip4.dst == 172.16.0.0/16 && udp) allow-related
> from-lport  100 (ip4.dst == 172.16.0.0/16 && ip4.src == 10.1.1.0/24 &&
> tcp) allow-related
>

Or have an address set, "addrset1", which contains {172.16.10.0/24,
192.168.20.0/24, 172.16.0.0/20, 192.168.0.0/16, 172.16.0.0/16}.

acl  from-lport  100 (ip4.dst == $addrset1 && tcp && tcp.dst == {80, 8080})
allow-related
acl  from-lport  100 (ip4.dst == $addrset1 && udp) allow-related
acl  from-lport  100 (ip4.dst == $addrset1 && ip4.src == 10.1.1.0/24 &&
tcp) allow-related


>
> If there are more IP addresses in feature 2, then the number
> of ACL rules will climb geometrically:
> (4 feature 1 rules * # feature 2 allow-related rules + # feature 2 drop
> rules)
>
> With 2 separate ACL stages, the rules just go straight into
> the corresponding ACL table, no merge required:
> (# feature 1 rules + # feature 2 rules)
>

Thanks for elaborating.  I'm not opposed.  It seems harmless if not being
used.

Can you update the docs to indicate the specific accepted values for
"stage"?  It currently sounds like you can use as many stages as you want
to me.

Darrell Ball Aug. 2, 2016, 4:26 p.m. UTC | #5

On Tue, Aug 2, 2016 at 4:52 AM, Russell Bryant <russell@ovn.org> wrote:

> On Sat, Jul 30, 2016 at 4:19 PM, Mickey Spiegel <mickeys.dev@gmail.com>
> wrote:
>
> > On Fri, Jul 29, 2016 at 10:28 AM, Mickey Spiegel <emspiege@us.ibm.com>
> > wrote:
> > >
> > > -----"dev" <dev-bounces@openvswitch.org> wrote: -----
> > >> To: Mickey Spiegel <mickeys.dev@gmail.com>
> > >> From: Russell Bryant
> > >> Sent by: "dev"
> > >> Date: 07/29/2016 10:02AM
> > >> Cc: ovs dev <dev@openvswitch.org>
> > >> Subject: Re: [ovs-dev] [PATCH] ovn: Add second ACL stage
> > >>
> > >> On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <
> mickeys.dev@gmail.com
> > >
> > >> wrote:
> > >>
> > >>>
> > >>> This patch adds a second logical switch ingress ACL stage, and
> > >>> correspondingly a second logical switch egress ACL stage.  This
> > >>> allows for more than one ACL-based feature to be applied in the
> > >>> ingress and egress logical switch pipelines.  The features
> > >>> driving the different ACL stages may be configured by different
> > >>> users, for example an application deployer managing security
> > >>> groups and a network or security admin configuring network ACLs
> > >>> or firewall rules.
> > >>>
> > >>> Each ACL stage is self contained.  The "action" for the
> > >>> highest-"priority" matching row in an ACL stage determines a
> > >>> packet's treatment.  A separate "action" will be determined in
> > >>> each ACL stage, according to the ACL rules configured for that
> > >>> ACL stage.  The "priority" values are only relevant within the
> > >>> context of an ACL stage.
> > >>>
> > >>> ACL rules that do not specify an ACL stage are applied to the
> > >>> default "acl" stage.
> > >>>
> > >>> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>
> > >>
> > >>
> > >> Could you expand on why priorities in a single stage aren't enough to
> > >> satisfy the use case?
> > >
> > > If two features are configured independently with a mix of
> > > prioritized allow and drop rules, then with a single stage, a
> > > new set of ACL rules must be produced that achieves the same
> > > behavior.  This is sometimes referred to as an "ACL merge"
> > > algorithm, for example:
> > >
> >
> http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514
> > >
> > > In the worst case, for example when the features act on different
> > > packet fields (e.g. one on IP address and another on L4 port),
> > > the number of rules required can approach
> > > (# of ACL1 rules) * (# of ACL2 rules).
> > >
> > > While it is possible to code up such an algorithm, it adds
> > > significant complexity and complicates whichever layer
> > > implements the merge algorithm, either OVN or the CMS above.
> > >
> > > By using multiple independent pipeline stages, all of this
> > > software complexity is avoided, achieving the proper result
> > > in a simple and straightforward manner.
> > >
> > > Recent network hardware ASICs tend to have around 8 or 10 ACL
> > > stages, though they tend to evaluate these in parallel given
> > > all the emphasis on low latency these days.
> >
> > Throwing in an example to illustrate the difference between one
> > ACL stage and two ACL stages:
> >
> > If two separate ACL stages:
> > Feature 1
> > acl  from-lport  100 (tcp == 80) allow-related
> > acl  from-lport  100 (tcp == 8080) allow-related
> > acl  from-lport  100 (udp) allow-related
> > acl  from-lport  100 (ip4.src == 10.1.1.0/24 && tcp) allow-related
> >
> > Feature 2
> > acl2 from-lport  300 (ip4.dst == 172.16.10.0/24) allow-related
> > acl2 from-lport  300 (ip4.dst == 192.168.20.0/24) allow-related
> > acl2 from-lport  200 (ip4.dst == 172.16.0.0/20) drop
> > acl2 from-lport  200 (ip4.dst == 192.168.0.0/16) drop
> > acl2 from-lport  100 (ip4.dst == 172.16.0.0/16) allow-related
> >
> > Combined in one stage, to get the equivalent behavior, this would
> require:
> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 80) allow-related
> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 8080) allow-related
> > from-lport  300 (ip4.dst == 172.16.10.0/24 && udp) allow-related
> > from-lport  300 (ip4.dst == 172.16.10.0/24 && ip4.src == 10.1.1.0/24 &&
> > tcp) allow-related
> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 80) allow-related
> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 8080)
> allow-related
> > from-lport  300 (ip4.dst == 192.168.20.0/24 && udp) allow-related
> > from-lport  300 (ip4.dst == 192.168.20.0/24 && ip4.src == 10.1.1.0/24 &&
> > tcp) allow-related
> > from-lport  200 (ip4.dst == 172.16.0.0/20) drop
> > from-lport  200 (ip4.dst == 192.168.0.0/16) drop
> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 80) allow-related
> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 8080) allow-related
> > from-lport  100 (ip4.dst == 172.16.0.0/16 && udp) allow-related
> > from-lport  100 (ip4.dst == 172.16.0.0/16 && ip4.src == 10.1.1.0/24 &&
> > tcp) allow-related
> >
>
> Or have an address set, "addrset1", which contains {172.16.10.0/24,
> 192.168.20.0/24, 172.16.0.0/20, 192.168.0.0/16, 172.16.0.0/16}.
>
> acl  from-lport  100 (ip4.dst == $addrset1 && tcp && tcp.dst == {80, 8080})
> allow-related
> acl  from-lport  100 (ip4.dst == $addrset1 && udp) allow-related
> acl  from-lport  100 (ip4.dst == $addrset1 && ip4.src == 10.1.1.0/24 &&
> tcp) allow-related
>
>
> >
> > If there are more IP addresses in feature 2, then the number
> > of ACL rules will climb geometrically:
> > (4 feature 1 rules * # feature 2 allow-related rules + # feature 2 drop
> > rules)
> >
> > With 2 separate ACL stages, the rules just go straight into
> > the corresponding ACL table, no merge required:
> > (# feature 1 rules + # feature 2 rules)
> >
>
> Thanks for elaborating.  I'm not opposed.  It seems harmless if not being
> used.
>


There are presently no unit tests for ACLs in the system tests
(system-ovn.at).
The first step should be to add unit tests for single stage ACLs.
and then add a delta of tests if other stages are desired.

It will be good to test the coordination between multiple stages
coming directly from northbound APIs and check what happens when
multistage ACLs are setup and torn down stage by stage, particularly
when the datapath ends up in a more permissive state for some period of
time.



>
> Can you update the docs to indicate the specific accepted values for
> "stage"?



This would significantly complicate the usage of northbound ACL APIs,
since multi-staging would be exposed at the top (northbound) OVN layer.

This would need a clear set of guidelines how northbound
multistage ACLs would be used by a CMS, at the user level.



> It currently sounds like you can use as many stages as you want
> to me.
>
> --
> Russell Bryant
> _______________________________________________
> dev mailing list
> dev@openvswitch.org
> http://openvswitch.org/mailman/listinfo/dev
>

Mickey Spiegel Aug. 2, 2016, 5:23 p.m. UTC | #6

On Tue, Aug 2, 2016 at 9:26 AM, Darrell Ball <dlu998@gmail.com> wrote:

>
>
> On Tue, Aug 2, 2016 at 4:52 AM, Russell Bryant <russell@ovn.org> wrote:
>
>> On Sat, Jul 30, 2016 at 4:19 PM, Mickey Spiegel <mickeys.dev@gmail.com>
>> wrote:
>>
>> > On Fri, Jul 29, 2016 at 10:28 AM, Mickey Spiegel <emspiege@us.ibm.com>
>> > wrote:
>> > >
>> > > -----"dev" <dev-bounces@openvswitch.org> wrote: -----
>> > >> To: Mickey Spiegel <mickeys.dev@gmail.com>
>> > >> From: Russell Bryant
>> > >> Sent by: "dev"
>> > >> Date: 07/29/2016 10:02AM
>> > >> Cc: ovs dev <dev@openvswitch.org>
>> > >> Subject: Re: [ovs-dev] [PATCH] ovn: Add second ACL stage
>> > >>
>> > >> On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <
>> mickeys.dev@gmail.com
>> > >
>> > >> wrote:
>> > >>
>> > >>>
>> > >>> This patch adds a second logical switch ingress ACL stage, and
>> > >>> correspondingly a second logical switch egress ACL stage.  This
>> > >>> allows for more than one ACL-based feature to be applied in the
>> > >>> ingress and egress logical switch pipelines.  The features
>> > >>> driving the different ACL stages may be configured by different
>> > >>> users, for example an application deployer managing security
>> > >>> groups and a network or security admin configuring network ACLs
>> > >>> or firewall rules.
>> > >>>
>> > >>> Each ACL stage is self contained.  The "action" for the
>> > >>> highest-"priority" matching row in an ACL stage determines a
>> > >>> packet's treatment.  A separate "action" will be determined in
>> > >>> each ACL stage, according to the ACL rules configured for that
>> > >>> ACL stage.  The "priority" values are only relevant within the
>> > >>> context of an ACL stage.
>> > >>>
>> > >>> ACL rules that do not specify an ACL stage are applied to the
>> > >>> default "acl" stage.
>> > >>>
>> > >>> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>
>> > >>
>> > >>
>> > >> Could you expand on why priorities in a single stage aren't enough to
>> > >> satisfy the use case?
>> > >
>> > > If two features are configured independently with a mix of
>> > > prioritized allow and drop rules, then with a single stage, a
>> > > new set of ACL rules must be produced that achieves the same
>> > > behavior.  This is sometimes referred to as an "ACL merge"
>> > > algorithm, for example:
>> > >
>> >
>> http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514
>> > >
>> > > In the worst case, for example when the features act on different
>> > > packet fields (e.g. one on IP address and another on L4 port),
>> > > the number of rules required can approach
>> > > (# of ACL1 rules) * (# of ACL2 rules).
>> > >
>> > > While it is possible to code up such an algorithm, it adds
>> > > significant complexity and complicates whichever layer
>> > > implements the merge algorithm, either OVN or the CMS above.
>> > >
>> > > By using multiple independent pipeline stages, all of this
>> > > software complexity is avoided, achieving the proper result
>> > > in a simple and straightforward manner.
>> > >
>> > > Recent network hardware ASICs tend to have around 8 or 10 ACL
>> > > stages, though they tend to evaluate these in parallel given
>> > > all the emphasis on low latency these days.
>> >
>> > Throwing in an example to illustrate the difference between one
>> > ACL stage and two ACL stages:
>> >
>> > If two separate ACL stages:
>> > Feature 1
>> > acl  from-lport  100 (tcp == 80) allow-related
>> > acl  from-lport  100 (tcp == 8080) allow-related
>> > acl  from-lport  100 (udp) allow-related
>> > acl  from-lport  100 (ip4.src == 10.1.1.0/24 && tcp) allow-related
>> >
>> > Feature 2
>> > acl2 from-lport  300 (ip4.dst == 172.16.10.0/24) allow-related
>> > acl2 from-lport  300 (ip4.dst == 192.168.20.0/24) allow-related
>> > acl2 from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>> > acl2 from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>> > acl2 from-lport  100 (ip4.dst == 172.16.0.0/16) allow-related
>> >
>> > Combined in one stage, to get the equivalent behavior, this would
>> require:
>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 80) allow-related
>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 8080)
>> allow-related
>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && udp) allow-related
>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && ip4.src == 10.1.1.0/24 &&
>> > tcp) allow-related
>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 80) allow-related
>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 8080)
>> allow-related
>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && udp) allow-related
>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && ip4.src == 10.1.1.0/24
>> &&
>> > tcp) allow-related
>> > from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>> > from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 80) allow-related
>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 8080) allow-related
>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && udp) allow-related
>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && ip4.src == 10.1.1.0/24 &&
>> > tcp) allow-related
>> >
>>
>> Or have an address set, "addrset1", which contains {172.16.10.0/24,
>> 192.168.20.0/24, 172.16.0.0/20, 192.168.0.0/16, 172.16.0.0/16}.
>>
>> acl  from-lport  100 (ip4.dst == $addrset1 && tcp && tcp.dst == {80,
>> 8080})
>> allow-related
>> acl  from-lport  100 (ip4.dst == $addrset1 && udp) allow-related
>> acl  from-lport  100 (ip4.dst == $addrset1 && ip4.src == 10.1.1.0/24 &&
>> tcp) allow-related
>>
>>
>> >
>> > If there are more IP addresses in feature 2, then the number
>> > of ACL rules will climb geometrically:
>> > (4 feature 1 rules * # feature 2 allow-related rules + # feature 2 drop
>> > rules)
>> >
>> > With 2 separate ACL stages, the rules just go straight into
>> > the corresponding ACL table, no merge required:
>> > (# feature 1 rules + # feature 2 rules)
>> >
>>
>> Thanks for elaborating.  I'm not opposed.  It seems harmless if not being
>> used.
>>
>
>
> There are presently no unit tests for ACLs in the system tests
> (system-ovn.at).
> The first step should be to add unit tests for single stage ACLs.
> and then add a delta of tests if other stages are desired.
>
> It will be good to test the coordination between multiple stages
> coming directly from northbound APIs and check what happens when
> multistage ACLs are setup and torn down stage by stage, particularly
> when the datapath ends up in a more permissive state for some period of
> time.
>
>
>
>>
>> Can you update the docs to indicate the specific accepted values for
>> "stage"?
>
>
>
> This would significantly complicate the usage of northbound ACL APIs,
> since multi-staging would be exposed at the top (northbound) OVN layer.
>

The default behavior when "stage" is not specified is to apply the ACL to
the
existing "acl" stage. If you don't care about the second ACL stage, continue
to use ACLs as you do today and it will work. There is no complication.


> This would need a clear set of guidelines how northbound
> multistage ACLs would be used by a CMS, at the user level.
>

The CMS typically does not expose ACLs directly to the user. For example,
with OpenStack, Security Groups use the default "acl" stage. OpenStack
FWaaS v2 would use the "acl2" stage. These are two separate OpenStack
features with separate OpenStack northbound APIs to the user.

Mickey


>
>> It currently sounds like you can use as many stages as you want
>> to me.
>>
>> --
>> Russell Bryant
>> _______________________________________________
>> dev mailing list
>> dev@openvswitch.org
>> http://openvswitch.org/mailman/listinfo/dev
>>
>
>

Gurucharan Shetty Aug. 2, 2016, 5:29 p.m. UTC | #7

On 2 August 2016 at 10:23, Mickey Spiegel <mickeys.dev@gmail.com> wrote:

> On Tue, Aug 2, 2016 at 9:26 AM, Darrell Ball <dlu998@gmail.com> wrote:
>
> >
> >
> > On Tue, Aug 2, 2016 at 4:52 AM, Russell Bryant <russell@ovn.org> wrote:
> >
> >> On Sat, Jul 30, 2016 at 4:19 PM, Mickey Spiegel <mickeys.dev@gmail.com>
> >> wrote:
> >>
> >> > On Fri, Jul 29, 2016 at 10:28 AM, Mickey Spiegel <emspiege@us.ibm.com
> >
> >> > wrote:
> >> > >
> >> > > -----"dev" <dev-bounces@openvswitch.org> wrote: -----
> >> > >> To: Mickey Spiegel <mickeys.dev@gmail.com>
> >> > >> From: Russell Bryant
> >> > >> Sent by: "dev"
> >> > >> Date: 07/29/2016 10:02AM
> >> > >> Cc: ovs dev <dev@openvswitch.org>
> >> > >> Subject: Re: [ovs-dev] [PATCH] ovn: Add second ACL stage
> >> > >>
> >> > >> On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <
> >> mickeys.dev@gmail.com
> >> > >
> >> > >> wrote:
> >> > >>
> >> > >>>
> >> > >>> This patch adds a second logical switch ingress ACL stage, and
> >> > >>> correspondingly a second logical switch egress ACL stage.  This
> >> > >>> allows for more than one ACL-based feature to be applied in the
> >> > >>> ingress and egress logical switch pipelines.  The features
> >> > >>> driving the different ACL stages may be configured by different
> >> > >>> users, for example an application deployer managing security
> >> > >>> groups and a network or security admin configuring network ACLs
> >> > >>> or firewall rules.
> >> > >>>
> >> > >>> Each ACL stage is self contained.  The "action" for the
> >> > >>> highest-"priority" matching row in an ACL stage determines a
> >> > >>> packet's treatment.  A separate "action" will be determined in
> >> > >>> each ACL stage, according to the ACL rules configured for that
> >> > >>> ACL stage.  The "priority" values are only relevant within the
> >> > >>> context of an ACL stage.
> >> > >>>
> >> > >>> ACL rules that do not specify an ACL stage are applied to the
> >> > >>> default "acl" stage.
> >> > >>>
> >> > >>> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>
> >> > >>
> >> > >>
> >> > >> Could you expand on why priorities in a single stage aren't enough
> to
> >> > >> satisfy the use case?
> >> > >
> >> > > If two features are configured independently with a mix of
> >> > > prioritized allow and drop rules, then with a single stage, a
> >> > > new set of ACL rules must be produced that achieves the same
> >> > > behavior.  This is sometimes referred to as an "ACL merge"
> >> > > algorithm, for example:
> >> > >
> >> >
> >>
> http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514
> >> > >
> >> > > In the worst case, for example when the features act on different
> >> > > packet fields (e.g. one on IP address and another on L4 port),
> >> > > the number of rules required can approach
> >> > > (# of ACL1 rules) * (# of ACL2 rules).
> >> > >
> >> > > While it is possible to code up such an algorithm, it adds
> >> > > significant complexity and complicates whichever layer
> >> > > implements the merge algorithm, either OVN or the CMS above.
> >> > >
> >> > > By using multiple independent pipeline stages, all of this
> >> > > software complexity is avoided, achieving the proper result
> >> > > in a simple and straightforward manner.
> >> > >
> >> > > Recent network hardware ASICs tend to have around 8 or 10 ACL
> >> > > stages, though they tend to evaluate these in parallel given
> >> > > all the emphasis on low latency these days.
> >> >
> >> > Throwing in an example to illustrate the difference between one
> >> > ACL stage and two ACL stages:
> >> >
> >> > If two separate ACL stages:
> >> > Feature 1
> >> > acl  from-lport  100 (tcp == 80) allow-related
> >> > acl  from-lport  100 (tcp == 8080) allow-related
> >> > acl  from-lport  100 (udp) allow-related
> >> > acl  from-lport  100 (ip4.src == 10.1.1.0/24 && tcp) allow-related
> >> >
> >> > Feature 2
> >> > acl2 from-lport  300 (ip4.dst == 172.16.10.0/24) allow-related
> >> > acl2 from-lport  300 (ip4.dst == 192.168.20.0/24) allow-related
> >> > acl2 from-lport  200 (ip4.dst == 172.16.0.0/20) drop
> >> > acl2 from-lport  200 (ip4.dst == 192.168.0.0/16) drop
> >> > acl2 from-lport  100 (ip4.dst == 172.16.0.0/16) allow-related
> >> >
> >> > Combined in one stage, to get the equivalent behavior, this would
> >> require:
> >> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 80)
> allow-related
> >> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 8080)
> >> allow-related
> >> > from-lport  300 (ip4.dst == 172.16.10.0/24 && udp) allow-related
> >> > from-lport  300 (ip4.dst == 172.16.10.0/24 && ip4.src == 10.1.1.0/24
> &&
> >> > tcp) allow-related
> >> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 80)
> allow-related
> >> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 8080)
> >> allow-related
> >> > from-lport  300 (ip4.dst == 192.168.20.0/24 && udp) allow-related
> >> > from-lport  300 (ip4.dst == 192.168.20.0/24 && ip4.src == 10.1.1.0/24
> >> &&
> >> > tcp) allow-related
> >> > from-lport  200 (ip4.dst == 172.16.0.0/20) drop
> >> > from-lport  200 (ip4.dst == 192.168.0.0/16) drop
> >> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 80) allow-related
> >> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 8080)
> allow-related
> >> > from-lport  100 (ip4.dst == 172.16.0.0/16 && udp) allow-related
> >> > from-lport  100 (ip4.dst == 172.16.0.0/16 && ip4.src == 10.1.1.0/24
> &&
> >> > tcp) allow-related
> >> >
> >>
> >> Or have an address set, "addrset1", which contains {172.16.10.0/24,
> >> 192.168.20.0/24, 172.16.0.0/20, 192.168.0.0/16, 172.16.0.0/16}.
> >>
> >> acl  from-lport  100 (ip4.dst == $addrset1 && tcp && tcp.dst == {80,
> >> 8080})
> >> allow-related
> >> acl  from-lport  100 (ip4.dst == $addrset1 && udp) allow-related
> >> acl  from-lport  100 (ip4.dst == $addrset1 && ip4.src == 10.1.1.0/24 &&
> >> tcp) allow-related
> >>
> >>
> >> >
> >> > If there are more IP addresses in feature 2, then the number
> >> > of ACL rules will climb geometrically:
> >> > (4 feature 1 rules * # feature 2 allow-related rules + # feature 2
> drop
> >> > rules)
> >> >
> >> > With 2 separate ACL stages, the rules just go straight into
> >> > the corresponding ACL table, no merge required:
> >> > (# feature 1 rules + # feature 2 rules)
> >> >
> >>
> >> Thanks for elaborating.  I'm not opposed.  It seems harmless if not
> being
> >> used.
> >>
> >
> >
> > There are presently no unit tests for ACLs in the system tests
> > (system-ovn.at).
> > The first step should be to add unit tests for single stage ACLs.
> > and then add a delta of tests if other stages are desired.
> >
> > It will be good to test the coordination between multiple stages
> > coming directly from northbound APIs and check what happens when
> > multistage ACLs are setup and torn down stage by stage, particularly
> > when the datapath ends up in a more permissive state for some period of
> > time.
> >
> >
> >
> >>
> >> Can you update the docs to indicate the specific accepted values for
> >> "stage"?
> >
> >
> >
> > This would significantly complicate the usage of northbound ACL APIs,
> > since multi-staging would be exposed at the top (northbound) OVN layer.
> >
>
> The default behavior when "stage" is not specified is to apply the ACL to
> the
> existing "acl" stage. If you don't care about the second ACL stage,
> continue
> to use ACLs as you do today and it will work. There is no complication.
>

The 2 ct_commit for deletion of firewall rules will likely be tricky. This
will need unit tests.


>
>
> > This would need a clear set of guidelines how northbound
> > multistage ACLs would be used by a CMS, at the user level.
> >
>
> The CMS typically does not expose ACLs directly to the user. For example,
> with OpenStack, Security Groups use the default "acl" stage. OpenStack
> FWaaS v2 would use the "acl2" stage. These are two separate OpenStack
> features with separate OpenStack northbound APIs to the user.
>
> Mickey
>
>
> >
> >> It currently sounds like you can use as many stages as you want
> >> to me.
> >>
> >> --
> >> Russell Bryant
> >> _______________________________________________
> >> dev mailing list
> >> dev@openvswitch.org
> >> http://openvswitch.org/mailman/listinfo/dev
> >>
> >
> >
> _______________________________________________
> dev mailing list
> dev@openvswitch.org
> http://openvswitch.org/mailman/listinfo/dev
>

Russell Bryant Aug. 2, 2016, 7:01 p.m. UTC | #8

On Tue, Aug 2, 2016 at 1:29 PM, Guru Shetty <guru@ovn.org> wrote:

> The 2 ct_commit for deletion of firewall rules will likely be tricky. This
> will need unit tests.
>

I don't think I understand the concern.  Can you expand a bit on what you
mean by "2 ct_commit for deletion of firewall rules"?

Darrell Ball Aug. 2, 2016, 7:02 p.m. UTC | #9

On Tue, Aug 2, 2016 at 10:23 AM, Mickey Spiegel <mickeys.dev@gmail.com>
wrote:

> On Tue, Aug 2, 2016 at 9:26 AM, Darrell Ball <dlu998@gmail.com> wrote:
>
>>
>>
>> On Tue, Aug 2, 2016 at 4:52 AM, Russell Bryant <russell@ovn.org> wrote:
>>
>>> On Sat, Jul 30, 2016 at 4:19 PM, Mickey Spiegel <mickeys.dev@gmail.com>
>>> wrote:
>>>
>>> > On Fri, Jul 29, 2016 at 10:28 AM, Mickey Spiegel <emspiege@us.ibm.com>
>>> > wrote:
>>> > >
>>> > > -----"dev" <dev-bounces@openvswitch.org> wrote: -----
>>> > >> To: Mickey Spiegel <mickeys.dev@gmail.com>
>>> > >> From: Russell Bryant
>>> > >> Sent by: "dev"
>>> > >> Date: 07/29/2016 10:02AM
>>> > >> Cc: ovs dev <dev@openvswitch.org>
>>> > >> Subject: Re: [ovs-dev] [PATCH] ovn: Add second ACL stage
>>> > >>
>>> > >> On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <
>>> mickeys.dev@gmail.com
>>> > >
>>> > >> wrote:
>>> > >>
>>> > >>>
>>> > >>> This patch adds a second logical switch ingress ACL stage, and
>>> > >>> correspondingly a second logical switch egress ACL stage.  This
>>> > >>> allows for more than one ACL-based feature to be applied in the
>>> > >>> ingress and egress logical switch pipelines.  The features
>>> > >>> driving the different ACL stages may be configured by different
>>> > >>> users, for example an application deployer managing security
>>> > >>> groups and a network or security admin configuring network ACLs
>>> > >>> or firewall rules.
>>> > >>>
>>> > >>> Each ACL stage is self contained.  The "action" for the
>>> > >>> highest-"priority" matching row in an ACL stage determines a
>>> > >>> packet's treatment.  A separate "action" will be determined in
>>> > >>> each ACL stage, according to the ACL rules configured for that
>>> > >>> ACL stage.  The "priority" values are only relevant within the
>>> > >>> context of an ACL stage.
>>> > >>>
>>> > >>> ACL rules that do not specify an ACL stage are applied to the
>>> > >>> default "acl" stage.
>>> > >>>
>>> > >>> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>
>>> > >>
>>> > >>
>>> > >> Could you expand on why priorities in a single stage aren't enough
>>> to
>>> > >> satisfy the use case?
>>> > >
>>> > > If two features are configured independently with a mix of
>>> > > prioritized allow and drop rules, then with a single stage, a
>>> > > new set of ACL rules must be produced that achieves the same
>>> > > behavior.  This is sometimes referred to as an "ACL merge"
>>> > > algorithm, for example:
>>> > >
>>> >
>>> http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514
>>> > >
>>> > > In the worst case, for example when the features act on different
>>> > > packet fields (e.g. one on IP address and another on L4 port),
>>> > > the number of rules required can approach
>>> > > (# of ACL1 rules) * (# of ACL2 rules).
>>> > >
>>> > > While it is possible to code up such an algorithm, it adds
>>> > > significant complexity and complicates whichever layer
>>> > > implements the merge algorithm, either OVN or the CMS above.
>>> > >
>>> > > By using multiple independent pipeline stages, all of this
>>> > > software complexity is avoided, achieving the proper result
>>> > > in a simple and straightforward manner.
>>> > >
>>> > > Recent network hardware ASICs tend to have around 8 or 10 ACL
>>> > > stages, though they tend to evaluate these in parallel given
>>> > > all the emphasis on low latency these days.
>>> >
>>> > Throwing in an example to illustrate the difference between one
>>> > ACL stage and two ACL stages:
>>> >
>>> > If two separate ACL stages:
>>> > Feature 1
>>> > acl  from-lport  100 (tcp == 80) allow-related
>>> > acl  from-lport  100 (tcp == 8080) allow-related
>>> > acl  from-lport  100 (udp) allow-related
>>> > acl  from-lport  100 (ip4.src == 10.1.1.0/24 && tcp) allow-related
>>> >
>>> > Feature 2
>>> > acl2 from-lport  300 (ip4.dst == 172.16.10.0/24) allow-related
>>> > acl2 from-lport  300 (ip4.dst == 192.168.20.0/24) allow-related
>>> > acl2 from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>>> > acl2 from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>>> > acl2 from-lport  100 (ip4.dst == 172.16.0.0/16) allow-related
>>> >
>>> > Combined in one stage, to get the equivalent behavior, this would
>>> require:
>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 80) allow-related
>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 8080)
>>> allow-related
>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && udp) allow-related
>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && ip4.src == 10.1.1.0/24
>>> &&
>>> > tcp) allow-related
>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 80)
>>> allow-related
>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 8080)
>>> allow-related
>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && udp) allow-related
>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && ip4.src == 10.1.1.0/24
>>> &&
>>> > tcp) allow-related
>>> > from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>>> > from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 80) allow-related
>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 8080)
>>> allow-related
>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && udp) allow-related
>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && ip4.src == 10.1.1.0/24 &&
>>> > tcp) allow-related
>>> >
>>>
>>> Or have an address set, "addrset1", which contains {172.16.10.0/24,
>>> 192.168.20.0/24, 172.16.0.0/20, 192.168.0.0/16, 172.16.0.0/16}.
>>>
>>> acl  from-lport  100 (ip4.dst == $addrset1 && tcp && tcp.dst == {80,
>>> 8080})
>>> allow-related
>>> acl  from-lport  100 (ip4.dst == $addrset1 && udp) allow-related
>>> acl  from-lport  100 (ip4.dst == $addrset1 && ip4.src == 10.1.1.0/24 &&
>>> tcp) allow-related
>>>
>>>
>>> >
>>> > If there are more IP addresses in feature 2, then the number
>>> > of ACL rules will climb geometrically:
>>> > (4 feature 1 rules * # feature 2 allow-related rules + # feature 2 drop
>>> > rules)
>>> >
>>> > With 2 separate ACL stages, the rules just go straight into
>>> > the corresponding ACL table, no merge required:
>>> > (# feature 1 rules + # feature 2 rules)
>>> >
>>>
>>> Thanks for elaborating.  I'm not opposed.  It seems harmless if not being
>>> used.
>>>
>>
>>
>> There are presently no unit tests for ACLs in the system tests
>> (system-ovn.at).
>> The first step should be to add unit tests for single stage ACLs.
>> and then add a delta of tests if other stages are desired.
>>
>> It will be good to test the coordination between multiple stages
>> coming directly from northbound APIs and check what happens when
>> multistage ACLs are setup and torn down stage by stage, particularly
>> when the datapath ends up in a more permissive state for some period of
>> time.
>>
>
This feature proposal has a problem for both setup and teardown where
the staging will result in a more permissive state for periods of time.

Here is a simple example based on your example above:
If one only wants to allow TCP and src IP 20.20.20.20 and the stage with
TCP is
added first with the stage with src IP 20.20.20.20 lagging, one will have
the
following

200 TCP permit
100 DROP ALL

which permits all TCP - not what we want.

We cannot enforce a transaction across multiple databases (NB, SB,
ovn-controller)



>
>>
>>
>>>
>>> Can you update the docs to indicate the specific accepted values for
>>> "stage"?
>>
>>
>>
>> This would significantly complicate the usage of northbound ACL APIs,
>> since multi-staging would be exposed at the top (northbound) OVN layer.
>>
>
> The default behavior when "stage" is not specified is to apply the ACL to
> the
> existing "acl" stage. If you don't care about the second ACL stage,
> continue
> to use ACLs as you do today and it will work. There is no complication.
>

You need a set of guidelines.
You just cannot assume the northbound API usage will avoid this feature.
How does one know this feature should be avoided or when to use it.
Assuming one decides to use it, how does one know how to use it.



>
>
>> This would need a clear set of guidelines how northbound
>> multistage ACLs would be used by a CMS, at the user level.
>>
>
> The CMS typically does not expose ACLs directly to the user. For example,
> with OpenStack, Security Groups use the default "acl" stage. OpenStack
> FWaaS v2 would use the "acl2" stage. These are two separate OpenStack
> features with separate OpenStack northbound APIs to the user.
>


First of all, every OVN feature should not be tied to Openstack.



>
> Mickey
>
>
>>
>>> It currently sounds like you can use as many stages as you want
>>> to me.
>>>
>>> --
>>> Russell Bryant
>>> _______________________________________________
>>> dev mailing list
>>> dev@openvswitch.org
>>> http://openvswitch.org/mailman/listinfo/dev
>>>
>>
>>
>

Russell Bryant Aug. 2, 2016, 7:05 p.m. UTC | #10

On Tue, Aug 2, 2016 at 3:02 PM, Darrell Ball <dlu998@gmail.com> wrote:

>
>
> On Tue, Aug 2, 2016 at 10:23 AM, Mickey Spiegel <mickeys.dev@gmail.com>
> wrote:
>
>> On Tue, Aug 2, 2016 at 9:26 AM, Darrell Ball <dlu998@gmail.com> wrote:
>>
>>>
>>>
>>> On Tue, Aug 2, 2016 at 4:52 AM, Russell Bryant <russell@ovn.org> wrote:
>>>
>>>> On Sat, Jul 30, 2016 at 4:19 PM, Mickey Spiegel <mickeys.dev@gmail.com>
>>>> wrote:
>>>>
>>>> > On Fri, Jul 29, 2016 at 10:28 AM, Mickey Spiegel <emspiege@us.ibm.com
>>>> >
>>>> > wrote:
>>>> > >
>>>> > > -----"dev" <dev-bounces@openvswitch.org> wrote: -----
>>>> > >> To: Mickey Spiegel <mickeys.dev@gmail.com>
>>>> > >> From: Russell Bryant
>>>> > >> Sent by: "dev"
>>>> > >> Date: 07/29/2016 10:02AM
>>>> > >> Cc: ovs dev <dev@openvswitch.org>
>>>> > >> Subject: Re: [ovs-dev] [PATCH] ovn: Add second ACL stage
>>>> > >>
>>>> > >> On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <
>>>> mickeys.dev@gmail.com
>>>> > >
>>>> > >> wrote:
>>>> > >>
>>>> > >>>
>>>> > >>> This patch adds a second logical switch ingress ACL stage, and
>>>> > >>> correspondingly a second logical switch egress ACL stage.  This
>>>> > >>> allows for more than one ACL-based feature to be applied in the
>>>> > >>> ingress and egress logical switch pipelines.  The features
>>>> > >>> driving the different ACL stages may be configured by different
>>>> > >>> users, for example an application deployer managing security
>>>> > >>> groups and a network or security admin configuring network ACLs
>>>> > >>> or firewall rules.
>>>> > >>>
>>>> > >>> Each ACL stage is self contained.  The "action" for the
>>>> > >>> highest-"priority" matching row in an ACL stage determines a
>>>> > >>> packet's treatment.  A separate "action" will be determined in
>>>> > >>> each ACL stage, according to the ACL rules configured for that
>>>> > >>> ACL stage.  The "priority" values are only relevant within the
>>>> > >>> context of an ACL stage.
>>>> > >>>
>>>> > >>> ACL rules that do not specify an ACL stage are applied to the
>>>> > >>> default "acl" stage.
>>>> > >>>
>>>> > >>> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>
>>>> > >>
>>>> > >>
>>>> > >> Could you expand on why priorities in a single stage aren't enough
>>>> to
>>>> > >> satisfy the use case?
>>>> > >
>>>> > > If two features are configured independently with a mix of
>>>> > > prioritized allow and drop rules, then with a single stage, a
>>>> > > new set of ACL rules must be produced that achieves the same
>>>> > > behavior.  This is sometimes referred to as an "ACL merge"
>>>> > > algorithm, for example:
>>>> > >
>>>> >
>>>> http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514
>>>> > >
>>>> > > In the worst case, for example when the features act on different
>>>> > > packet fields (e.g. one on IP address and another on L4 port),
>>>> > > the number of rules required can approach
>>>> > > (# of ACL1 rules) * (# of ACL2 rules).
>>>> > >
>>>> > > While it is possible to code up such an algorithm, it adds
>>>> > > significant complexity and complicates whichever layer
>>>> > > implements the merge algorithm, either OVN or the CMS above.
>>>> > >
>>>> > > By using multiple independent pipeline stages, all of this
>>>> > > software complexity is avoided, achieving the proper result
>>>> > > in a simple and straightforward manner.
>>>> > >
>>>> > > Recent network hardware ASICs tend to have around 8 or 10 ACL
>>>> > > stages, though they tend to evaluate these in parallel given
>>>> > > all the emphasis on low latency these days.
>>>> >
>>>> > Throwing in an example to illustrate the difference between one
>>>> > ACL stage and two ACL stages:
>>>> >
>>>> > If two separate ACL stages:
>>>> > Feature 1
>>>> > acl  from-lport  100 (tcp == 80) allow-related
>>>> > acl  from-lport  100 (tcp == 8080) allow-related
>>>> > acl  from-lport  100 (udp) allow-related
>>>> > acl  from-lport  100 (ip4.src == 10.1.1.0/24 && tcp) allow-related
>>>> >
>>>> > Feature 2
>>>> > acl2 from-lport  300 (ip4.dst == 172.16.10.0/24) allow-related
>>>> > acl2 from-lport  300 (ip4.dst == 192.168.20.0/24) allow-related
>>>> > acl2 from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>>>> > acl2 from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>>>> > acl2 from-lport  100 (ip4.dst == 172.16.0.0/16) allow-related
>>>> >
>>>> > Combined in one stage, to get the equivalent behavior, this would
>>>> require:
>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 80)
>>>> allow-related
>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 8080)
>>>> allow-related
>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && udp) allow-related
>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && ip4.src == 10.1.1.0/24
>>>> &&
>>>> > tcp) allow-related
>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 80)
>>>> allow-related
>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 8080)
>>>> allow-related
>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && udp) allow-related
>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && ip4.src == 10.1.1.0/24
>>>> &&
>>>> > tcp) allow-related
>>>> > from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>>>> > from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 80) allow-related
>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 8080)
>>>> allow-related
>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && udp) allow-related
>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && ip4.src == 10.1.1.0/24
>>>> &&
>>>> > tcp) allow-related
>>>> >
>>>>
>>>> Or have an address set, "addrset1", which contains {172.16.10.0/24,
>>>> 192.168.20.0/24, 172.16.0.0/20, 192.168.0.0/16, 172.16.0.0/16}.
>>>>
>>>> acl  from-lport  100 (ip4.dst == $addrset1 && tcp && tcp.dst == {80,
>>>> 8080})
>>>> allow-related
>>>> acl  from-lport  100 (ip4.dst == $addrset1 && udp) allow-related
>>>> acl  from-lport  100 (ip4.dst == $addrset1 && ip4.src == 10.1.1.0/24 &&
>>>> tcp) allow-related
>>>>
>>>>
>>>> >
>>>> > If there are more IP addresses in feature 2, then the number
>>>> > of ACL rules will climb geometrically:
>>>> > (4 feature 1 rules * # feature 2 allow-related rules + # feature 2
>>>> drop
>>>> > rules)
>>>> >
>>>> > With 2 separate ACL stages, the rules just go straight into
>>>> > the corresponding ACL table, no merge required:
>>>> > (# feature 1 rules + # feature 2 rules)
>>>> >
>>>>
>>>> Thanks for elaborating.  I'm not opposed.  It seems harmless if not
>>>> being
>>>> used.
>>>>
>>>
>>>
>>> There are presently no unit tests for ACLs in the system tests
>>> (system-ovn.at).
>>> The first step should be to add unit tests for single stage ACLs.
>>> and then add a delta of tests if other stages are desired.
>>>
>>> It will be good to test the coordination between multiple stages
>>> coming directly from northbound APIs and check what happens when
>>> multistage ACLs are setup and torn down stage by stage, particularly
>>> when the datapath ends up in a more permissive state for some period of
>>> time.
>>>
>>
> This feature proposal has a problem for both setup and teardown where
> the staging will result in a more permissive state for periods of time.
>
> Here is a simple example based on your example above:
> If one only wants to allow TCP and src IP 20.20.20.20 and the stage with
> TCP is
> added first with the stage with src IP 20.20.20.20 lagging, one will have
> the
> following
>
> 200 TCP permit
> 100 DROP ALL
>
> which permits all TCP - not what we want.
>
> We cannot enforce a transaction across multiple databases (NB, SB,
> ovn-controller)
>

I don't understand this.  Rules for both stages could be added in the same
transaction.  It's all in the same table of the northbound database.


>
>
>
>>
>>>
>>>
>>>>
>>>> Can you update the docs to indicate the specific accepted values for
>>>> "stage"?
>>>
>>>
>>>
>>> This would significantly complicate the usage of northbound ACL APIs,
>>> since multi-staging would be exposed at the top (northbound) OVN layer.
>>>
>>
>> The default behavior when "stage" is not specified is to apply the ACL to
>> the
>> existing "acl" stage. If you don't care about the second ACL stage,
>> continue
>> to use ACLs as you do today and it will work. There is no complication.
>>
>
> You need a set of guidelines.
> You just cannot assume the northbound API usage will avoid this feature.
> How does one know this feature should be avoided or when to use it.
> Assuming one decides to use it, how does one know how to use it.
>
>
>
>>
>>
>>> This would need a clear set of guidelines how northbound
>>> multistage ACLs would be used by a CMS, at the user level.
>>>
>>
>> The CMS typically does not expose ACLs directly to the user. For example,
>> with OpenStack, Security Groups use the default "acl" stage. OpenStack
>> FWaaS v2 would use the "acl2" stage. These are two separate OpenStack
>> features with separate OpenStack northbound APIs to the user.
>>
>
>
> First of all, every OVN feature should not be tied to Openstack.]
>

It was just used as an example of how it would be used ...

Gurucharan Shetty Aug. 2, 2016, 7:17 p.m. UTC | #11

On 2 August 2016 at 12:01, Russell Bryant <russell@ovn.org> wrote:

>
> On Tue, Aug 2, 2016 at 1:29 PM, Guru Shetty <guru@ovn.org> wrote:
>
>> The 2 ct_commit for deletion of firewall rules will likely be tricky. This
>> will need unit tests.
>>
>
> I don't think I understand the concern.  Can you expand a bit on what you
> mean by "2 ct_commit for deletion of firewall rules"?
>

My memory on how ct_commit(ct_label=1) works is a little hazy. There are 2
stages now. So whenever a firewall rule is deleted for an established
connection, the default ct_commit(ct_label=1) will get hit and the
connection is dropped. The same thing happens in the second stage for any
removed firewall rule. In the second stage when a firewall rule is deleted
ct_label is also set which will reflect in the first stage. Does not this
cause confusion with the logic?

>
>
> --
> Russell Bryant
>

Russell Bryant Aug. 2, 2016, 7:27 p.m. UTC | #12

On Tue, Aug 2, 2016 at 3:17 PM, Guru Shetty <guru@ovn.org> wrote:

>
>
> On 2 August 2016 at 12:01, Russell Bryant <russell@ovn.org> wrote:
>
>>
>> On Tue, Aug 2, 2016 at 1:29 PM, Guru Shetty <guru@ovn.org> wrote:
>>
>>> The 2 ct_commit for deletion of firewall rules will likely be tricky.
>>> This
>>> will need unit tests.
>>>
>>
>> I don't think I understand the concern.  Can you expand a bit on what you
>> mean by "2 ct_commit for deletion of firewall rules"?
>>
>
> My memory on how ct_commit(ct_label=1) works is a little hazy. There are 2
> stages now. So whenever a firewall rule is deleted for an established
> connection, the default ct_commit(ct_label=1) will get hit and the
> connection is dropped. The same thing happens in the second stage for any
> removed firewall rule. In the second stage when a firewall rule is deleted
> ct_label is also set which will reflect in the first stage. Does not this
> cause confusion with the logic?
>

Setting ct_label back to 0 only happens in the stateful table.  That
ct_commit will only occur if none of the ACL stages think the packet should
be dropped.  I think it's OK.

Gurucharan Shetty Aug. 2, 2016, 7:35 p.m. UTC | #13

On 2 August 2016 at 12:27, Russell Bryant <russell@ovn.org> wrote:

>
>
> On Tue, Aug 2, 2016 at 3:17 PM, Guru Shetty <guru@ovn.org> wrote:
>
>>
>>
>> On 2 August 2016 at 12:01, Russell Bryant <russell@ovn.org> wrote:
>>
>>>
>>> On Tue, Aug 2, 2016 at 1:29 PM, Guru Shetty <guru@ovn.org> wrote:
>>>
>>>> The 2 ct_commit for deletion of firewall rules will likely be tricky.
>>>> This
>>>> will need unit tests.
>>>>
>>>
>>> I don't think I understand the concern.  Can you expand a bit on what
>>> you mean by "2 ct_commit for deletion of firewall rules"?
>>>
>>
>> My memory on how ct_commit(ct_label=1) works is a little hazy. There are
>> 2 stages now. So whenever a firewall rule is deleted for an established
>> connection, the default ct_commit(ct_label=1) will get hit and the
>> connection is dropped. The same thing happens in the second stage for any
>> removed firewall rule. In the second stage when a firewall rule is deleted
>> ct_label is also set which will reflect in the first stage. Does not this
>> cause confusion with the logic?
>>
>
> Setting ct_label back to 0 only happens in the stateful table.  That
> ct_commit will only occur if none of the ACL stages think the packet should
> be dropped.  I think it's OK.
>

I see. I think we should still consider unit tests now. Userspace datapath
has ct_commit now (it still can't do NAT). That should ideally work. If
that does not work, we should consider adding tests to system-ovn.at




>
> --
> Russell Bryant
>

Russell Bryant Aug. 2, 2016, 8:14 p.m. UTC | #14

On Tue, Aug 2, 2016 at 3:35 PM, Guru Shetty <guru@ovn.org> wrote:

>
>
> On 2 August 2016 at 12:27, Russell Bryant <russell@ovn.org> wrote:
>
>>
>>
>> On Tue, Aug 2, 2016 at 3:17 PM, Guru Shetty <guru@ovn.org> wrote:
>>
>>>
>>>
>>> On 2 August 2016 at 12:01, Russell Bryant <russell@ovn.org> wrote:
>>>
>>>>
>>>> On Tue, Aug 2, 2016 at 1:29 PM, Guru Shetty <guru@ovn.org> wrote:
>>>>
>>>>> The 2 ct_commit for deletion of firewall rules will likely be tricky.
>>>>> This
>>>>> will need unit tests.
>>>>>
>>>>
>>>> I don't think I understand the concern.  Can you expand a bit on what
>>>> you mean by "2 ct_commit for deletion of firewall rules"?
>>>>
>>>
>>> My memory on how ct_commit(ct_label=1) works is a little hazy. There are
>>> 2 stages now. So whenever a firewall rule is deleted for an established
>>> connection, the default ct_commit(ct_label=1) will get hit and the
>>> connection is dropped. The same thing happens in the second stage for any
>>> removed firewall rule. In the second stage when a firewall rule is deleted
>>> ct_label is also set which will reflect in the first stage. Does not this
>>> cause confusion with the logic?
>>>
>>
>> Setting ct_label back to 0 only happens in the stateful table.  That
>> ct_commit will only occur if none of the ACL stages think the packet should
>> be dropped.  I think it's OK.
>>
>
> I see. I think we should still consider unit tests now. Userspace datapath
> has ct_commit now (it still can't do NAT). That should ideally work. If
> that does not work, we should consider adding tests to system-ovn.at
>

Yes, I agree that this area is sorely lacking in test coverage.

Darrell Ball Aug. 2, 2016, 8:39 p.m. UTC | #15

On Tue, Aug 2, 2016 at 12:05 PM, Russell Bryant <russell@ovn.org> wrote:

>
>
> On Tue, Aug 2, 2016 at 3:02 PM, Darrell Ball <dlu998@gmail.com> wrote:
>
>>
>>
>> On Tue, Aug 2, 2016 at 10:23 AM, Mickey Spiegel <mickeys.dev@gmail.com>
>> wrote:
>>
>>> On Tue, Aug 2, 2016 at 9:26 AM, Darrell Ball <dlu998@gmail.com> wrote:
>>>
>>>>
>>>>
>>>> On Tue, Aug 2, 2016 at 4:52 AM, Russell Bryant <russell@ovn.org> wrote:
>>>>
>>>>> On Sat, Jul 30, 2016 at 4:19 PM, Mickey Spiegel <mickeys.dev@gmail.com
>>>>> >
>>>>> wrote:
>>>>>
>>>>> > On Fri, Jul 29, 2016 at 10:28 AM, Mickey Spiegel <
>>>>> emspiege@us.ibm.com>
>>>>> > wrote:
>>>>> > >
>>>>> > > -----"dev" <dev-bounces@openvswitch.org> wrote: -----
>>>>> > >> To: Mickey Spiegel <mickeys.dev@gmail.com>
>>>>> > >> From: Russell Bryant
>>>>> > >> Sent by: "dev"
>>>>> > >> Date: 07/29/2016 10:02AM
>>>>> > >> Cc: ovs dev <dev@openvswitch.org>
>>>>> > >> Subject: Re: [ovs-dev] [PATCH] ovn: Add second ACL stage
>>>>> > >>
>>>>> > >> On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <
>>>>> mickeys.dev@gmail.com
>>>>> > >
>>>>> > >> wrote:
>>>>> > >>
>>>>> > >>>
>>>>> > >>> This patch adds a second logical switch ingress ACL stage, and
>>>>> > >>> correspondingly a second logical switch egress ACL stage.  This
>>>>> > >>> allows for more than one ACL-based feature to be applied in the
>>>>> > >>> ingress and egress logical switch pipelines.  The features
>>>>> > >>> driving the different ACL stages may be configured by different
>>>>> > >>> users, for example an application deployer managing security
>>>>> > >>> groups and a network or security admin configuring network ACLs
>>>>> > >>> or firewall rules.
>>>>> > >>>
>>>>> > >>> Each ACL stage is self contained.  The "action" for the
>>>>> > >>> highest-"priority" matching row in an ACL stage determines a
>>>>> > >>> packet's treatment.  A separate "action" will be determined in
>>>>> > >>> each ACL stage, according to the ACL rules configured for that
>>>>> > >>> ACL stage.  The "priority" values are only relevant within the
>>>>> > >>> context of an ACL stage.
>>>>> > >>>
>>>>> > >>> ACL rules that do not specify an ACL stage are applied to the
>>>>> > >>> default "acl" stage.
>>>>> > >>>
>>>>> > >>> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>
>>>>> > >>
>>>>> > >>
>>>>> > >> Could you expand on why priorities in a single stage aren't
>>>>> enough to
>>>>> > >> satisfy the use case?
>>>>> > >
>>>>> > > If two features are configured independently with a mix of
>>>>> > > prioritized allow and drop rules, then with a single stage, a
>>>>> > > new set of ACL rules must be produced that achieves the same
>>>>> > > behavior.  This is sometimes referred to as an "ACL merge"
>>>>> > > algorithm, for example:
>>>>> > >
>>>>> >
>>>>> http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514
>>>>> > >
>>>>> > > In the worst case, for example when the features act on different
>>>>> > > packet fields (e.g. one on IP address and another on L4 port),
>>>>> > > the number of rules required can approach
>>>>> > > (# of ACL1 rules) * (# of ACL2 rules).
>>>>> > >
>>>>> > > While it is possible to code up such an algorithm, it adds
>>>>> > > significant complexity and complicates whichever layer
>>>>> > > implements the merge algorithm, either OVN or the CMS above.
>>>>> > >
>>>>> > > By using multiple independent pipeline stages, all of this
>>>>> > > software complexity is avoided, achieving the proper result
>>>>> > > in a simple and straightforward manner.
>>>>> > >
>>>>> > > Recent network hardware ASICs tend to have around 8 or 10 ACL
>>>>> > > stages, though they tend to evaluate these in parallel given
>>>>> > > all the emphasis on low latency these days.
>>>>> >
>>>>> > Throwing in an example to illustrate the difference between one
>>>>> > ACL stage and two ACL stages:
>>>>> >
>>>>> > If two separate ACL stages:
>>>>> > Feature 1
>>>>> > acl  from-lport  100 (tcp == 80) allow-related
>>>>> > acl  from-lport  100 (tcp == 8080) allow-related
>>>>> > acl  from-lport  100 (udp) allow-related
>>>>> > acl  from-lport  100 (ip4.src == 10.1.1.0/24 && tcp) allow-related
>>>>> >
>>>>> > Feature 2
>>>>> > acl2 from-lport  300 (ip4.dst == 172.16.10.0/24) allow-related
>>>>> > acl2 from-lport  300 (ip4.dst == 192.168.20.0/24) allow-related
>>>>> > acl2 from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>>>>> > acl2 from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>>>>> > acl2 from-lport  100 (ip4.dst == 172.16.0.0/16) allow-related
>>>>> >
>>>>> > Combined in one stage, to get the equivalent behavior, this would
>>>>> require:
>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 80)
>>>>> allow-related
>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 8080)
>>>>> allow-related
>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && udp) allow-related
>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && ip4.src == 10.1.1.0/24
>>>>> &&
>>>>> > tcp) allow-related
>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 80)
>>>>> allow-related
>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 8080)
>>>>> allow-related
>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && udp) allow-related
>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && ip4.src ==
>>>>> 10.1.1.0/24 &&
>>>>> > tcp) allow-related
>>>>> > from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>>>>> > from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 80)
>>>>> allow-related
>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 8080)
>>>>> allow-related
>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && udp) allow-related
>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && ip4.src == 10.1.1.0/24
>>>>> &&
>>>>> > tcp) allow-related
>>>>> >
>>>>>
>>>>> Or have an address set, "addrset1", which contains {172.16.10.0/24,
>>>>> 192.168.20.0/24, 172.16.0.0/20, 192.168.0.0/16, 172.16.0.0/16}.
>>>>>
>>>>> acl  from-lport  100 (ip4.dst == $addrset1 && tcp && tcp.dst == {80,
>>>>> 8080})
>>>>> allow-related
>>>>> acl  from-lport  100 (ip4.dst == $addrset1 && udp) allow-related
>>>>> acl  from-lport  100 (ip4.dst == $addrset1 && ip4.src == 10.1.1.0/24
>>>>> &&
>>>>> tcp) allow-related
>>>>>
>>>>>
>>>>> >
>>>>> > If there are more IP addresses in feature 2, then the number
>>>>> > of ACL rules will climb geometrically:
>>>>> > (4 feature 1 rules * # feature 2 allow-related rules + # feature 2
>>>>> drop
>>>>> > rules)
>>>>> >
>>>>> > With 2 separate ACL stages, the rules just go straight into
>>>>> > the corresponding ACL table, no merge required:
>>>>> > (# feature 1 rules + # feature 2 rules)
>>>>> >
>>>>>
>>>>> Thanks for elaborating.  I'm not opposed.  It seems harmless if not
>>>>> being
>>>>> used.
>>>>>
>>>>
>>>>
>>>> There are presently no unit tests for ACLs in the system tests
>>>> (system-ovn.at).
>>>> The first step should be to add unit tests for single stage ACLs.
>>>> and then add a delta of tests if other stages are desired.
>>>>
>>>> It will be good to test the coordination between multiple stages
>>>> coming directly from northbound APIs and check what happens when
>>>> multistage ACLs are setup and torn down stage by stage, particularly
>>>> when the datapath ends up in a more permissive state for some period of
>>>> time.
>>>>
>>>
>> This feature proposal has a problem for both setup and teardown where
>> the staging will result in a more permissive state for periods of time.
>>
>> Here is a simple example based on your example above:
>> If one only wants to allow TCP and src IP 20.20.20.20 and the stage with
>> TCP is
>> added first with the stage with src IP 20.20.20.20 lagging, one will have
>> the
>> following
>>
>> 200 TCP permit
>> 100 DROP ALL
>>
>> which permits all TCP - not what we want.
>>
>> We cannot enforce a transaction across multiple databases (NB, SB,
>> ovn-controller)
>>
>
> I don't understand this.  Rules for both stages could be added in the same
> transaction.  It's all in the same table of the northbound database.
>
>

I am assuming that the rules would be entered into the Northbound database
in the same
transaction. That part is fine.

However, there is no enforcement of a transaction across multiple databases
in
OVN. So there is no requirement that northd and ovn-controller maintain
that NB DB transaction
across different tables which generating their respective output (i.e. SB
DB and openflow rules).





>
>>
>>
>>>
>>>>
>>>>
>>>>>
>>>>> Can you update the docs to indicate the specific accepted values for
>>>>> "stage"?
>>>>
>>>>
>>>>
>>>> This would significantly complicate the usage of northbound ACL APIs,
>>>> since multi-staging would be exposed at the top (northbound) OVN layer.
>>>>
>>>
>>> The default behavior when "stage" is not specified is to apply the ACL
>>> to the
>>> existing "acl" stage. If you don't care about the second ACL stage,
>>> continue
>>> to use ACLs as you do today and it will work. There is no complication.
>>>
>>
>> You need a set of guidelines.
>> You just cannot assume the northbound API usage will avoid this feature.
>> How does one know this feature should be avoided or when to use it.
>> Assuming one decides to use it, how does one know how to use it.
>>
>>
>>
>>>
>>>
>>>> This would need a clear set of guidelines how northbound
>>>> multistage ACLs would be used by a CMS, at the user level.
>>>>
>>>
>>> The CMS typically does not expose ACLs directly to the user. For example,
>>> with OpenStack, Security Groups use the default "acl" stage. OpenStack
>>> FWaaS v2 would use the "acl2" stage. These are two separate OpenStack
>>> features with separate OpenStack northbound APIs to the user.
>>>
>>
>>
>> First of all, every OVN feature should not be tied to Openstack.]
>>
>
> It was just used as an example of how it would be used ...
>
> --
> Russell Bryant
>

Mickey Spiegel Aug. 2, 2016, 9:38 p.m. UTC | #16

On Tue, Aug 2, 2016 at 1:39 PM, Darrell Ball <dlu998@gmail.com> wrote:

>
>
> On Tue, Aug 2, 2016 at 12:05 PM, Russell Bryant <russell@ovn.org> wrote:
>
>>
>>
>> On Tue, Aug 2, 2016 at 3:02 PM, Darrell Ball <dlu998@gmail.com> wrote:
>>
>>>
>>>
>>> On Tue, Aug 2, 2016 at 10:23 AM, Mickey Spiegel <mickeys.dev@gmail.com>
>>> wrote:
>>>
>>>> On Tue, Aug 2, 2016 at 9:26 AM, Darrell Ball <dlu998@gmail.com> wrote:
>>>>
>>>>>
>>>>>
>>>>> On Tue, Aug 2, 2016 at 4:52 AM, Russell Bryant <russell@ovn.org>
>>>>> wrote:
>>>>>
>>>>>> On Sat, Jul 30, 2016 at 4:19 PM, Mickey Spiegel <
>>>>>> mickeys.dev@gmail.com>
>>>>>> wrote:
>>>>>>
>>>>>> > On Fri, Jul 29, 2016 at 10:28 AM, Mickey Spiegel <
>>>>>> emspiege@us.ibm.com>
>>>>>> > wrote:
>>>>>> > >
>>>>>> > > -----"dev" <dev-bounces@openvswitch.org> wrote: -----
>>>>>> > >> To: Mickey Spiegel <mickeys.dev@gmail.com>
>>>>>> > >> From: Russell Bryant
>>>>>> > >> Sent by: "dev"
>>>>>> > >> Date: 07/29/2016 10:02AM
>>>>>> > >> Cc: ovs dev <dev@openvswitch.org>
>>>>>> > >> Subject: Re: [ovs-dev] [PATCH] ovn: Add second ACL stage
>>>>>> > >>
>>>>>> > >> On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <
>>>>>> mickeys.dev@gmail.com
>>>>>> > >
>>>>>> > >> wrote:
>>>>>> > >>
>>>>>> > >>>
>>>>>> > >>> This patch adds a second logical switch ingress ACL stage, and
>>>>>> > >>> correspondingly a second logical switch egress ACL stage.  This
>>>>>> > >>> allows for more than one ACL-based feature to be applied in the
>>>>>> > >>> ingress and egress logical switch pipelines.  The features
>>>>>> > >>> driving the different ACL stages may be configured by different
>>>>>> > >>> users, for example an application deployer managing security
>>>>>> > >>> groups and a network or security admin configuring network ACLs
>>>>>> > >>> or firewall rules.
>>>>>> > >>>
>>>>>> > >>> Each ACL stage is self contained.  The "action" for the
>>>>>> > >>> highest-"priority" matching row in an ACL stage determines a
>>>>>> > >>> packet's treatment.  A separate "action" will be determined in
>>>>>> > >>> each ACL stage, according to the ACL rules configured for that
>>>>>> > >>> ACL stage.  The "priority" values are only relevant within the
>>>>>> > >>> context of an ACL stage.
>>>>>> > >>>
>>>>>> > >>> ACL rules that do not specify an ACL stage are applied to the
>>>>>> > >>> default "acl" stage.
>>>>>> > >>>
>>>>>> > >>> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>
>>>>>> > >>
>>>>>> > >>
>>>>>> > >> Could you expand on why priorities in a single stage aren't
>>>>>> enough to
>>>>>> > >> satisfy the use case?
>>>>>> > >
>>>>>> > > If two features are configured independently with a mix of
>>>>>> > > prioritized allow and drop rules, then with a single stage, a
>>>>>> > > new set of ACL rules must be produced that achieves the same
>>>>>> > > behavior.  This is sometimes referred to as an "ACL merge"
>>>>>> > > algorithm, for example:
>>>>>> > >
>>>>>> >
>>>>>> http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514
>>>>>> > >
>>>>>> > > In the worst case, for example when the features act on different
>>>>>> > > packet fields (e.g. one on IP address and another on L4 port),
>>>>>> > > the number of rules required can approach
>>>>>> > > (# of ACL1 rules) * (# of ACL2 rules).
>>>>>> > >
>>>>>> > > While it is possible to code up such an algorithm, it adds
>>>>>> > > significant complexity and complicates whichever layer
>>>>>> > > implements the merge algorithm, either OVN or the CMS above.
>>>>>> > >
>>>>>> > > By using multiple independent pipeline stages, all of this
>>>>>> > > software complexity is avoided, achieving the proper result
>>>>>> > > in a simple and straightforward manner.
>>>>>> > >
>>>>>> > > Recent network hardware ASICs tend to have around 8 or 10 ACL
>>>>>> > > stages, though they tend to evaluate these in parallel given
>>>>>> > > all the emphasis on low latency these days.
>>>>>> >
>>>>>> > Throwing in an example to illustrate the difference between one
>>>>>> > ACL stage and two ACL stages:
>>>>>> >
>>>>>> > If two separate ACL stages:
>>>>>> > Feature 1
>>>>>> > acl  from-lport  100 (tcp == 80) allow-related
>>>>>> > acl  from-lport  100 (tcp == 8080) allow-related
>>>>>> > acl  from-lport  100 (udp) allow-related
>>>>>> > acl  from-lport  100 (ip4.src == 10.1.1.0/24 && tcp) allow-related
>>>>>> >
>>>>>> > Feature 2
>>>>>> > acl2 from-lport  300 (ip4.dst == 172.16.10.0/24) allow-related
>>>>>> > acl2 from-lport  300 (ip4.dst == 192.168.20.0/24) allow-related
>>>>>> > acl2 from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>>>>>> > acl2 from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>>>>>> > acl2 from-lport  100 (ip4.dst == 172.16.0.0/16) allow-related
>>>>>> >
>>>>>> > Combined in one stage, to get the equivalent behavior, this would
>>>>>> require:
>>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 80)
>>>>>> allow-related
>>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 8080)
>>>>>> allow-related
>>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && udp) allow-related
>>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && ip4.src ==
>>>>>> 10.1.1.0/24 &&
>>>>>> > tcp) allow-related
>>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 80)
>>>>>> allow-related
>>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 8080)
>>>>>> allow-related
>>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && udp) allow-related
>>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && ip4.src ==
>>>>>> 10.1.1.0/24 &&
>>>>>> > tcp) allow-related
>>>>>> > from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>>>>>> > from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 80)
>>>>>> allow-related
>>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 8080)
>>>>>> allow-related
>>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && udp) allow-related
>>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && ip4.src == 10.1.1.0/24
>>>>>> &&
>>>>>> > tcp) allow-related
>>>>>> >
>>>>>>
>>>>>> Or have an address set, "addrset1", which contains {172.16.10.0/24,
>>>>>> 192.168.20.0/24, 172.16.0.0/20, 192.168.0.0/16, 172.16.0.0/16}.
>>>>>>
>>>>>> acl  from-lport  100 (ip4.dst == $addrset1 && tcp && tcp.dst == {80,
>>>>>> 8080})
>>>>>> allow-related
>>>>>> acl  from-lport  100 (ip4.dst == $addrset1 && udp) allow-related
>>>>>> acl  from-lport  100 (ip4.dst == $addrset1 && ip4.src == 10.1.1.0/24
>>>>>> &&
>>>>>> tcp) allow-related
>>>>>>
>>>>>>
>>>>>> >
>>>>>> > If there are more IP addresses in feature 2, then the number
>>>>>> > of ACL rules will climb geometrically:
>>>>>> > (4 feature 1 rules * # feature 2 allow-related rules + # feature 2
>>>>>> drop
>>>>>> > rules)
>>>>>> >
>>>>>> > With 2 separate ACL stages, the rules just go straight into
>>>>>> > the corresponding ACL table, no merge required:
>>>>>> > (# feature 1 rules + # feature 2 rules)
>>>>>> >
>>>>>>
>>>>>> Thanks for elaborating.  I'm not opposed.  It seems harmless if not
>>>>>> being
>>>>>> used.
>>>>>>
>>>>>
>>>>>
>>>>> There are presently no unit tests for ACLs in the system tests
>>>>> (system-ovn.at).
>>>>> The first step should be to add unit tests for single stage ACLs.
>>>>> and then add a delta of tests if other stages are desired.
>>>>>
>>>>> It will be good to test the coordination between multiple stages
>>>>> coming directly from northbound APIs and check what happens when
>>>>> multistage ACLs are setup and torn down stage by stage, particularly
>>>>> when the datapath ends up in a more permissive state for some period
>>>>> of time.
>>>>>
>>>>
>>> This feature proposal has a problem for both setup and teardown where
>>> the staging will result in a more permissive state for periods of time.
>>>
>>> Here is a simple example based on your example above:
>>> If one only wants to allow TCP and src IP 20.20.20.20 and the stage with
>>> TCP is
>>> added first with the stage with src IP 20.20.20.20 lagging, one will
>>> have the
>>> following
>>>
>>> 200 TCP permit
>>> 100 DROP ALL
>>>
>>> which permits all TCP - not what we want.
>>>
>>> We cannot enforce a transaction across multiple databases (NB, SB,
>>> ovn-controller)
>>>
>>
That is not how this is meant to be used. I used one stage for IP addresses
and
one stage for L4 port as a worst case example of expansion due to ACL merge.
That is not the motivation for using two stages. The motivation is for two
different
features that are configured separately, with one example being OpenStack
Security Groups versus OpenStack FWaaS v2, another example being Security
Groups versus Network ACLs as in a rather popular public cloud.

If you have correlated intent, with TCP and src IP 20.20.20.20 belonging
together,
they should absolutely be put together in one rule in one common ACL stage.

I don't understand this.  Rules for both stages could be added in the same
>> transaction.  It's all in the same table of the northbound database.
>>
>>
>
> I am assuming that the rules would be entered into the Northbound database
> in the same
> transaction. That part is fine.
>
> However, there is no enforcement of a transaction across multiple
> databases in
> OVN. So there is no requirement that northd and ovn-controller maintain
> that NB DB transaction
> across different tables which generating their respective output (i.e. SB
> DB and openflow rules).
>
>
>
>
>
>>
>>>
>>>
>>>>
>>>>>
>>>>>
>>>>>>
>>>>>> Can you update the docs to indicate the specific accepted values for
>>>>>> "stage"?
>>>>>
>>>>>
>>>>>
>>>>> This would significantly complicate the usage of northbound ACL APIs,
>>>>> since multi-staging would be exposed at the top (northbound) OVN layer.
>>>>>
>>>>
>>>> The default behavior when "stage" is not specified is to apply the ACL
>>>> to the
>>>> existing "acl" stage. If you don't care about the second ACL stage,
>>>> continue
>>>> to use ACLs as you do today and it will work. There is no complication.
>>>>
>>>
>>> You need a set of guidelines.
>>> You just cannot assume the northbound API usage will avoid this feature.
>>> How does one know this feature should be avoided or when to use it.
>>> Assuming one decides to use it, how does one know how to use it.
>>>
>>
If you are exposing the OVN northbound API directly, then you have two
options:
1. Hide stage, and just program everything in the default "acl" stage.
2. Expose stage and try to explain how it works.

Hardware switches have had multiple ACL tables for many many years.
As far as I can remember, they are always used for different features that
are configured separately:
Security based on VLANs
Security based on L3 interface
QoS
Service Function Chaining
Control Plane Protection

This would need a clear set of guidelines how northbound
>>>>> multistage ACLs would be used by a CMS, at the user level.
>>>>>
>>>>
>>>> The CMS typically does not expose ACLs directly to the user. For
>>>> example,
>>>> with OpenStack, Security Groups use the default "acl" stage. OpenStack
>>>> FWaaS v2 would use the "acl2" stage. These are two separate OpenStack
>>>> features with separate OpenStack northbound APIs to the user.
>>>>
>>>
>>>
>>> First of all, every OVN feature should not be tied to Openstack.]
>>>
>>
>> It was just used as an example of how it would be used ...
>>
>
As I said above and Russell reiterated, OpenStack FWaaS is just one example.
That is why I went with the generic stage names of "acl" and "acl2" rather
than something like "fw".

Mickey


> --
>> Russell Bryant
>>
>
>

Darrell Ball Aug. 2, 2016, 10:38 p.m. UTC | #17

On Tue, Aug 2, 2016 at 2:38 PM, Mickey Spiegel <mickeys.dev@gmail.com>
wrote:

> On Tue, Aug 2, 2016 at 1:39 PM, Darrell Ball <dlu998@gmail.com> wrote:
>
>>
>>
>> On Tue, Aug 2, 2016 at 12:05 PM, Russell Bryant <russell@ovn.org> wrote:
>>
>>>
>>>
>>> On Tue, Aug 2, 2016 at 3:02 PM, Darrell Ball <dlu998@gmail.com> wrote:
>>>
>>>>
>>>>
>>>> On Tue, Aug 2, 2016 at 10:23 AM, Mickey Spiegel <mickeys.dev@gmail.com>
>>>> wrote:
>>>>
>>>>> On Tue, Aug 2, 2016 at 9:26 AM, Darrell Ball <dlu998@gmail.com> wrote:
>>>>>
>>>>>>
>>>>>>
>>>>>> On Tue, Aug 2, 2016 at 4:52 AM, Russell Bryant <russell@ovn.org>
>>>>>> wrote:
>>>>>>
>>>>>>> On Sat, Jul 30, 2016 at 4:19 PM, Mickey Spiegel <
>>>>>>> mickeys.dev@gmail.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>> > On Fri, Jul 29, 2016 at 10:28 AM, Mickey Spiegel <
>>>>>>> emspiege@us.ibm.com>
>>>>>>> > wrote:
>>>>>>> > >
>>>>>>> > > -----"dev" <dev-bounces@openvswitch.org> wrote: -----
>>>>>>> > >> To: Mickey Spiegel <mickeys.dev@gmail.com>
>>>>>>> > >> From: Russell Bryant
>>>>>>> > >> Sent by: "dev"
>>>>>>> > >> Date: 07/29/2016 10:02AM
>>>>>>> > >> Cc: ovs dev <dev@openvswitch.org>
>>>>>>> > >> Subject: Re: [ovs-dev] [PATCH] ovn: Add second ACL stage
>>>>>>> > >>
>>>>>>> > >> On Fri, Jul 29, 2016 at 12:47 AM, Mickey Spiegel <
>>>>>>> mickeys.dev@gmail.com
>>>>>>> > >
>>>>>>> > >> wrote:
>>>>>>> > >>
>>>>>>> > >>>
>>>>>>> > >>> This patch adds a second logical switch ingress ACL stage, and
>>>>>>> > >>> correspondingly a second logical switch egress ACL stage.  This
>>>>>>> > >>> allows for more than one ACL-based feature to be applied in the
>>>>>>> > >>> ingress and egress logical switch pipelines.  The features
>>>>>>> > >>> driving the different ACL stages may be configured by different
>>>>>>> > >>> users, for example an application deployer managing security
>>>>>>> > >>> groups and a network or security admin configuring network ACLs
>>>>>>> > >>> or firewall rules.
>>>>>>> > >>>
>>>>>>> > >>> Each ACL stage is self contained.  The "action" for the
>>>>>>> > >>> highest-"priority" matching row in an ACL stage determines a
>>>>>>> > >>> packet's treatment.  A separate "action" will be determined in
>>>>>>> > >>> each ACL stage, according to the ACL rules configured for that
>>>>>>> > >>> ACL stage.  The "priority" values are only relevant within the
>>>>>>> > >>> context of an ACL stage.
>>>>>>> > >>>
>>>>>>> > >>> ACL rules that do not specify an ACL stage are applied to the
>>>>>>> > >>> default "acl" stage.
>>>>>>> > >>>
>>>>>>> > >>> Signed-off-by: Mickey Spiegel <mickeys.dev@gmail.com>
>>>>>>> > >>
>>>>>>> > >>
>>>>>>> > >> Could you expand on why priorities in a single stage aren't
>>>>>>> enough to
>>>>>>> > >> satisfy the use case?
>>>>>>> > >
>>>>>>> > > If two features are configured independently with a mix of
>>>>>>> > > prioritized allow and drop rules, then with a single stage, a
>>>>>>> > > new set of ACL rules must be produced that achieves the same
>>>>>>> > > behavior.  This is sometimes referred to as an "ACL merge"
>>>>>>> > > algorithm, for example:
>>>>>>> > >
>>>>>>> >
>>>>>>> http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514
>>>>>>> > >
>>>>>>> > > In the worst case, for example when the features act on different
>>>>>>> > > packet fields (e.g. one on IP address and another on L4 port),
>>>>>>> > > the number of rules required can approach
>>>>>>> > > (# of ACL1 rules) * (# of ACL2 rules).
>>>>>>> > >
>>>>>>> > > While it is possible to code up such an algorithm, it adds
>>>>>>> > > significant complexity and complicates whichever layer
>>>>>>> > > implements the merge algorithm, either OVN or the CMS above.
>>>>>>> > >
>>>>>>> > > By using multiple independent pipeline stages, all of this
>>>>>>> > > software complexity is avoided, achieving the proper result
>>>>>>> > > in a simple and straightforward manner.
>>>>>>> > >
>>>>>>> > > Recent network hardware ASICs tend to have around 8 or 10 ACL
>>>>>>> > > stages, though they tend to evaluate these in parallel given
>>>>>>> > > all the emphasis on low latency these days.
>>>>>>> >
>>>>>>> > Throwing in an example to illustrate the difference between one
>>>>>>> > ACL stage and two ACL stages:
>>>>>>> >
>>>>>>> > If two separate ACL stages:
>>>>>>> > Feature 1
>>>>>>> > acl  from-lport  100 (tcp == 80) allow-related
>>>>>>> > acl  from-lport  100 (tcp == 8080) allow-related
>>>>>>> > acl  from-lport  100 (udp) allow-related
>>>>>>> > acl  from-lport  100 (ip4.src == 10.1.1.0/24 && tcp) allow-related
>>>>>>> >
>>>>>>> > Feature 2
>>>>>>> > acl2 from-lport  300 (ip4.dst == 172.16.10.0/24) allow-related
>>>>>>> > acl2 from-lport  300 (ip4.dst == 192.168.20.0/24) allow-related
>>>>>>> > acl2 from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>>>>>>> > acl2 from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>>>>>>> > acl2 from-lport  100 (ip4.dst == 172.16.0.0/16) allow-related
>>>>>>> >
>>>>>>> > Combined in one stage, to get the equivalent behavior, this would
>>>>>>> require:
>>>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 80)
>>>>>>> allow-related
>>>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && tcp == 8080)
>>>>>>> allow-related
>>>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && udp) allow-related
>>>>>>> > from-lport  300 (ip4.dst == 172.16.10.0/24 && ip4.src ==
>>>>>>> 10.1.1.0/24 &&
>>>>>>> > tcp) allow-related
>>>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 80)
>>>>>>> allow-related
>>>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && tcp == 8080)
>>>>>>> allow-related
>>>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && udp) allow-related
>>>>>>> > from-lport  300 (ip4.dst == 192.168.20.0/24 && ip4.src ==
>>>>>>> 10.1.1.0/24 &&
>>>>>>> > tcp) allow-related
>>>>>>> > from-lport  200 (ip4.dst == 172.16.0.0/20) drop
>>>>>>> > from-lport  200 (ip4.dst == 192.168.0.0/16) drop
>>>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 80)
>>>>>>> allow-related
>>>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && tcp == 8080)
>>>>>>> allow-related
>>>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && udp) allow-related
>>>>>>> > from-lport  100 (ip4.dst == 172.16.0.0/16 && ip4.src ==
>>>>>>> 10.1.1.0/24 &&
>>>>>>> > tcp) allow-related
>>>>>>> >
>>>>>>>
>>>>>>> Or have an address set, "addrset1", which contains {172.16.10.0/24,
>>>>>>> 192.168.20.0/24, 172.16.0.0/20, 192.168.0.0/16, 172.16.0.0/16}.
>>>>>>>
>>>>>>> acl  from-lport  100 (ip4.dst == $addrset1 && tcp && tcp.dst == {80,
>>>>>>> 8080})
>>>>>>> allow-related
>>>>>>> acl  from-lport  100 (ip4.dst == $addrset1 && udp) allow-related
>>>>>>> acl  from-lport  100 (ip4.dst == $addrset1 && ip4.src == 10.1.1.0/24
>>>>>>> &&
>>>>>>> tcp) allow-related
>>>>>>>
>>>>>>>
>>>>>>> >
>>>>>>> > If there are more IP addresses in feature 2, then the number
>>>>>>> > of ACL rules will climb geometrically:
>>>>>>> > (4 feature 1 rules * # feature 2 allow-related rules + # feature 2
>>>>>>> drop
>>>>>>> > rules)
>>>>>>> >
>>>>>>> > With 2 separate ACL stages, the rules just go straight into
>>>>>>> > the corresponding ACL table, no merge required:
>>>>>>> > (# feature 1 rules + # feature 2 rules)
>>>>>>> >
>>>>>>>
>>>>>>> Thanks for elaborating.  I'm not opposed.  It seems harmless if not
>>>>>>> being
>>>>>>> used.
>>>>>>>
>>>>>>
>>>>>>
>>>>>> There are presently no unit tests for ACLs in the system tests
>>>>>> (system-ovn.at).
>>>>>> The first step should be to add unit tests for single stage ACLs.
>>>>>> and then add a delta of tests if other stages are desired.
>>>>>>
>>>>>> It will be good to test the coordination between multiple stages
>>>>>> coming directly from northbound APIs and check what happens when
>>>>>> multistage ACLs are setup and torn down stage by stage, particularly
>>>>>> when the datapath ends up in a more permissive state for some period
>>>>>> of time.
>>>>>>
>>>>>
>>>> This feature proposal has a problem for both setup and teardown where
>>>> the staging will result in a more permissive state for periods of time.
>>>>
>>>> Here is a simple example based on your example above:
>>>> If one only wants to allow TCP and src IP 20.20.20.20 and the stage
>>>> with TCP is
>>>> added first with the stage with src IP 20.20.20.20 lagging, one will
>>>> have the
>>>> following
>>>>
>>>> 200 TCP permit
>>>> 100 DROP ALL
>>>>
>>>> which permits all TCP - not what we want.
>>>>
>>>> We cannot enforce a transaction across multiple databases (NB, SB,
>>>> ovn-controller)
>>>>
>>>
> That is not how this is meant to be used. I used one stage for IP
> addresses and
> one stage for L4 port as a worst case example of expansion due to ACL
> merge.
> That is not the motivation for using two stages. The motivation is for two
> different
> features that are configured separately, with one example being OpenStack
> Security Groups versus OpenStack FWaaS v2, another example being Security
> Groups versus Network ACLs as in a rather popular public cloud.
>
> If you have correlated intent, with TCP and src IP 20.20.20.20 belonging
> together,
> they should absolutely be put together in one rule in one common ACL stage.
>

Good - then this is part of the "OVN documentation" aspect I mentioned.
A CMS, such as Openstack or something else would need to know the details
of what is the recommended usage of NB APIs. Openstack is the user/client
in this case.



>
> I don't understand this.  Rules for both stages could be added in the same
>>> transaction.  It's all in the same table of the northbound database.
>>>
>>>
>>
>> I am assuming that the rules would be entered into the Northbound
>> database in the same
>> transaction. That part is fine.
>>
>> However, there is no enforcement of a transaction across multiple
>> databases in
>> OVN. So there is no requirement that northd and ovn-controller maintain
>> that NB DB transaction
>> across different tables which generating their respective output (i.e. SB
>> DB and openflow rules).
>>
>>
>>
>>
>>
>>>
>>>>
>>>>
>>>>>
>>>>>>
>>>>>>
>>>>>>>
>>>>>>> Can you update the docs to indicate the specific accepted values for
>>>>>>> "stage"?
>>>>>>
>>>>>>
>>>>>>
>>>>>> This would significantly complicate the usage of northbound ACL APIs,
>>>>>> since multi-staging would be exposed at the top (northbound) OVN
>>>>>> layer.
>>>>>>
>>>>>
>>>>> The default behavior when "stage" is not specified is to apply the ACL
>>>>> to the
>>>>> existing "acl" stage. If you don't care about the second ACL stage,
>>>>> continue
>>>>> to use ACLs as you do today and it will work. There is no complication.
>>>>>
>>>>
>>>> You need a set of guidelines.
>>>> You just cannot assume the northbound API usage will avoid this feature.
>>>> How does one know this feature should be avoided or when to use it.
>>>> Assuming one decides to use it, how does one know how to use it.
>>>>
>>>
> If you are exposing the OVN northbound API directly, then you have two
> options:
> 1. Hide stage, and just program everything in the default "acl" stage.
> 2. Expose stage and try to explain how it works.
>

I see ACL stages defined in the NB schema in this proposed patch.
So, this patch exposes stages to OVN clients, of which Openstack is just one
such possible client or "user of OVN".

Some tests along with use case documentation/recommedations is needed.



>
> Hardware switches have had multiple ACL tables for many many years.
> As far as I can remember, they are always used for different features that
> are configured separately:
> Security based on VLANs
> Security based on L3 interface
> QoS
> Service Function Chaining
> Control Plane Protection
>


There are multiple match/action capabilities in HW of different types.
Hardware switches and routers typically don't expose implementation details,
such as number of ACL stages, type of hardware resource etc at the
northbound
interface.




>
> This would need a clear set of guidelines how northbound
>>>>>> multistage ACLs would be used by a CMS, at the user level.
>>>>>>
>>>>>
>>>>> The CMS typically does not expose ACLs directly to the user. For
>>>>> example,
>>>>> with OpenStack, Security Groups use the default "acl" stage. OpenStack
>>>>> FWaaS v2 would use the "acl2" stage. These are two separate OpenStack
>>>>> features with separate OpenStack northbound APIs to the user.
>>>>>
>>>>
>>>>
>>>> First of all, every OVN feature should not be tied to Openstack.]
>>>>
>>>
>>> It was just used as an example of how it would be used ...
>>>
>>
> As I said above and Russell reiterated, OpenStack FWaaS is just one
> example.
> That is why I went with the generic stage names of "acl" and "acl2" rather
> than something like "fw".
>


Alright, perhaps some tests and use case documentation will be useful here.



>
> Mickey
>
>
>> --
>>> Russell Bryant
>>>
>>
>>
>

Ben Pfaff Aug. 14, 2016, 5:02 a.m. UTC | #18

On Fri, Jul 29, 2016 at 05:28:26PM +0000, Mickey Spiegel wrote:
> Could you expand on why priorities in a single stage aren't enough to
> satisfy the use case?
> 
> <Mickey>
> If two features are configured independently with a mix of
> prioritized allow and drop rules, then with a single stage, a
> new set of ACL rules must be produced that achieves the same
> behavior.  This is sometimes referred to as an "ACL merge"
> algorithm, for example:
> http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_paper09186a00800c9470.shtml#wp39514
> 
> In the worst case, for example when the features act on different
> packet fields (e.g. one on IP address and another on L4 port),
> the number of rules required can approach
> (# of ACL1 rules) * (# of ACL2 rules).
> 
> While it is possible to code up such an algorithm, it adds
> significant complexity and complicates whichever layer
> implements the merge algorithm, either OVN or the CMS above.
> 
> By using multiple independent pipeline stages, all of this
> software complexity is avoided, achieving the proper result
> in a simple and straightforward manner.
> 
> Recent network hardware ASICs tend to have around 8 or 10 ACL
> stages, though they tend to evaluate these in parallel given
> all the emphasis on low latency these days.

I guess that, in software, if there's a need for 2 of something, there's
usually a need for N of it, so I'd tend to prefer that instead of
hard-coding 2 stages of ACLs, we make N of them available (for perhaps N
== 8), especially given that you say hardware tends to work that way.
It's not really more expensive for OVS, and definitely not if only a few
of them are used.  We might need to expand the number of logical tables,
since currently there are only 16 ingress tables and 16 egress tables,
but doubling them to 32 each wouldn't be a big deal.

Mickey Spiegel Aug. 14, 2016, 10:21 p.m. UTC | #19

On Sat, Aug 13, 2016 at 10:02 PM, Ben Pfaff <blp@ovn.org> wrote:

> On Fri, Jul 29, 2016 at 05:28:26PM +0000, Mickey Spiegel wrote:
> > Could you expand on why priorities in a single stage aren't enough to
> > satisfy the use case?
> >
> > <Mickey>
> > If two features are configured independently with a mix of
> > prioritized allow and drop rules, then with a single stage, a
> > new set of ACL rules must be produced that achieves the same
> > behavior.  This is sometimes referred to as an "ACL merge"
> > algorithm, for example:
> > http://www.cisco.com/en/US/products/hw/switches/ps708/products_white_
> paper09186a00800c9470.shtml#wp39514
> >
> > In the worst case, for example when the features act on different
> > packet fields (e.g. one on IP address and another on L4 port),
> > the number of rules required can approach
> > (# of ACL1 rules) * (# of ACL2 rules).
> >
> > While it is possible to code up such an algorithm, it adds
> > significant complexity and complicates whichever layer
> > implements the merge algorithm, either OVN or the CMS above.
> >
> > By using multiple independent pipeline stages, all of this
> > software complexity is avoided, achieving the proper result
> > in a simple and straightforward manner.
> >
> > Recent network hardware ASICs tend to have around 8 or 10 ACL
> > stages, though they tend to evaluate these in parallel given
> > all the emphasis on low latency these days.
>
> I guess that, in software, if there's a need for 2 of something, there's
> usually a need for N of it, so I'd tend to prefer that instead of
> hard-coding 2 stages of ACLs, we make N of them available (for perhaps N
> == 8), especially given that you say hardware tends to work that way.
> It's not really more expensive for OVS, and definitely not if only a few
> of them are used.  We might need to expand the number of logical tables,
> since currently there are only 16 ingress tables and 16 egress tables,
> but doubling them to 32 each wouldn't be a big deal.
>

I did try to code the core part of the changes so that more ACL stages
could be easily added in the future, but the code having to do with
definition of the pipeline stages, associated functions, and nbctl is only
coded for 2 stages at the moment. Let me think about the best way to
generalize this.

As far as need and usage, I guess the key question is whether features
such as service function chaining and QoS marking will use generic ACL
stages, or pipeline stages specifically coded for those features?
In hardware switches, those type of features use many of the multiple
ACL stages.

The way I coded the patch, the fixed rules allowing and dropping
certain flows regardless of user-defined ACL rules are duplicated in
each ACL stage. However, I am not sure if those rules are necessary
or make sense if the actions for that pipeline stage are redirect (for SFC)
or QoS marking, rather than allow and drop. I need to think about it.

I have moved on to other things temporarily, will come back to this patch
if/when I have time to work on ACL tests, or if someone else adds ACL
tests.

Mickey

> _______________________________________________
> dev mailing list
> dev@openvswitch.org
> http://openvswitch.org/mailman/listinfo/dev
>

[ovs-dev] ovn: Add second ACL stage

Commit Message

Comments

Patch