diff mbox

[RFC,2/9] opp-modifier: Add opp-modifier-reg driver

Message ID 1394825135-60110-3-git-send-email-d-gerlach@ti.com
State Superseded, archived
Headers show

Commit Message

Dave Gerlach March 14, 2014, 7:25 p.m. UTC
Driver to read from a register and depending on either set bits or
a specific known selectively enable or disable OPPs based on DT node.

Can support opp-modifier-reg-bit where single bits within the register
determine the availability of an OPP or opp-modifier-reg-val where a
certain value inside the register or a portion of it determine what the
maximum allowed OPP is.

The driver expects a device that has already has its OPPs loaded
and then will disable the OPPs not matching the criteria specified in
the opp-modifier table.

Signed-off-by: Dave Gerlach <d-gerlach@ti.com>
---
 .../devicetree/bindings/power/opp-modifier.txt     | 111 +++++++++
 drivers/power/opp/Makefile                         |   1 +
 drivers/power/opp/opp-modifier-reg.c               | 259 +++++++++++++++++++++
 3 files changed, 371 insertions(+)
 create mode 100644 Documentation/devicetree/bindings/power/opp-modifier.txt
 create mode 100644 drivers/power/opp/opp-modifier-reg.c

Comments

Rob Herring March 14, 2014, 9 p.m. UTC | #1
On Fri, Mar 14, 2014 at 2:25 PM, Dave Gerlach <d-gerlach@ti.com> wrote:
> Driver to read from a register and depending on either set bits or
> a specific known selectively enable or disable OPPs based on DT node.
>
> Can support opp-modifier-reg-bit where single bits within the register
> determine the availability of an OPP or opp-modifier-reg-val where a
> certain value inside the register or a portion of it determine what the
> maximum allowed OPP is.
>
> The driver expects a device that has already has its OPPs loaded
> and then will disable the OPPs not matching the criteria specified in
> the opp-modifier table.
>
> Signed-off-by: Dave Gerlach <d-gerlach@ti.com>
> ---
>  .../devicetree/bindings/power/opp-modifier.txt     | 111 +++++++++
>  drivers/power/opp/Makefile                         |   1 +
>  drivers/power/opp/opp-modifier-reg.c               | 259 +++++++++++++++++++++
>  3 files changed, 371 insertions(+)
>  create mode 100644 Documentation/devicetree/bindings/power/opp-modifier.txt
>  create mode 100644 drivers/power/opp/opp-modifier-reg.c
>
> diff --git a/Documentation/devicetree/bindings/power/opp-modifier.txt b/Documentation/devicetree/bindings/power/opp-modifier.txt
> new file mode 100644
> index 0000000..af8a2e9
> --- /dev/null
> +++ b/Documentation/devicetree/bindings/power/opp-modifier.txt
> @@ -0,0 +1,111 @@
> +* OPP-Modifier - opp modifier to selectively enable operating points
> +
> +Many SoCs that have selectively modifiable OPPs can specify
> +all available OPPs in their operating-points listing and then define
> +opp_modifiers to enable or disable the OPPs that are actually available
> +on the specific hardware.
> +
> +* OPP Modifier Provider

Uggg. Please stop designing around the current OPP binding which has
the problem that the OPP table is not extensible to add more data.
Define a new OPP binding that solves these problems. This is at least
the 3rd OPP related binding addition I've seen recently. But I
wouldn't spend a lot of effort on a new OPP binding just to add the
functionality you are adding here because I don't like the whole
concept in general. This might be a common way to determine valid OPPs
on TI chips, but I think it is too low level and I don't want to see
bindings for every different possible way. Just add platform code to
do the OPP setup you need.

Frankly, I prefer the bootloader/firmware fixup the OPP table approach
mentioned in the cpufreq-cpu0 thread. Somewhat less desirable, but the
kernel could do the fixups as well (via of_update_property).

Rob
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Nishanth Menon March 17, 2014, 2:30 p.m. UTC | #2
On 03/14/2014 04:00 PM, Rob Herring wrote:
> On Fri, Mar 14, 2014 at 2:25 PM, Dave Gerlach <d-gerlach@ti.com> wrote:
>> Driver to read from a register and depending on either set bits or
>> a specific known selectively enable or disable OPPs based on DT node.
>>
>> Can support opp-modifier-reg-bit where single bits within the register
>> determine the availability of an OPP or opp-modifier-reg-val where a
>> certain value inside the register or a portion of it determine what the
>> maximum allowed OPP is.
>>
>> The driver expects a device that has already has its OPPs loaded
>> and then will disable the OPPs not matching the criteria specified in
>> the opp-modifier table.
>>
>> Signed-off-by: Dave Gerlach <d-gerlach@ti.com>
>> ---
>>  .../devicetree/bindings/power/opp-modifier.txt     | 111 +++++++++
>>  drivers/power/opp/Makefile                         |   1 +
>>  drivers/power/opp/opp-modifier-reg.c               | 259 +++++++++++++++++++++
>>  3 files changed, 371 insertions(+)
>>  create mode 100644 Documentation/devicetree/bindings/power/opp-modifier.txt
>>  create mode 100644 drivers/power/opp/opp-modifier-reg.c
>>
>> diff --git a/Documentation/devicetree/bindings/power/opp-modifier.txt b/Documentation/devicetree/bindings/power/opp-modifier.txt
>> new file mode 100644
>> index 0000000..af8a2e9
>> --- /dev/null
>> +++ b/Documentation/devicetree/bindings/power/opp-modifier.txt
>> @@ -0,0 +1,111 @@
>> +* OPP-Modifier - opp modifier to selectively enable operating points
>> +
>> +Many SoCs that have selectively modifiable OPPs can specify
>> +all available OPPs in their operating-points listing and then define
>> +opp_modifiers to enable or disable the OPPs that are actually available
>> +on the specific hardware.
>> +
>> +* OPP Modifier Provider
> 
> Uggg. Please stop designing around the current OPP binding which has
> the problem that the OPP table is not extensible to add more data.
> Define a new OPP binding that solves these problems. This is at least
Generically, there are three different issues with current OPP bindings:
a) ability to enable disable certain OPPs depending on SoC OTP/Efuse
settings.
b) ability to reuse OPPs defined for one device node for another (cpu1
to reuse OPP definitions of cpu0)
c) ability to add additional information per OPP. we can argue this is
a superset of (a), but really, the problems are different.

Previous proposals include making each OPP as a phandle, but there
does not seem much traction in that direction either. - proposal here
has nothing to do with (b) or (c).

> the 3rd OPP related binding addition I've seen recently. But I
> wouldn't spend a lot of effort on a new OPP binding just to add the
> functionality you are adding here because I don't like the whole
> concept in general. This might be a common way to determine valid OPPs
> on TI chips, but I think it is too low level and I don't want to see

Not just TI chips, but iMX, now, Marvell, Xilinx as well. potentially
more as well. doing OTP/Efuse based decision on which OPPs are valid
on a chip is not a TI specific thing. This was the reason for us to
try to define something generic enough to be reused by more SoCs than
just TI.

> bindings for every different possible way. Just add platform code to
> do the OPP setup you need.
Errr.. adding platform code means the hardware description goes back
to kernel. is'nt that giving up on device tree binding for describing
hardware?

> 
> Frankly, I prefer the bootloader/firmware fixup the OPP table approach
> mentioned in the cpufreq-cpu0 thread. Somewhat less desirable, but the
> kernel could do the fixups as well (via of_update_property).

a) Trying to move the hardware definition away from device tree seems
to me a major step backwards.
b) Allowing for definitions in platform code is a step backwards again
for a generic solution that works for more than 1 vendor.
c) moving the logic away to bootloader when it can easily be done in
kernel again is adding burden to bootloader for data it does need to
handle.

OPP is a hardware behavior, which OPPs are enabled are described in
hardware on certain SoCs. the current proposal is to provide a generic
solution for those devices that allow for dynamic definition of OPPs
based on SoC efuse definition.
Rob Herring March 17, 2014, 6:37 p.m. UTC | #3
On Mon, Mar 17, 2014 at 9:30 AM, Nishanth Menon <nm@ti.com> wrote:
> On 03/14/2014 04:00 PM, Rob Herring wrote:
>> On Fri, Mar 14, 2014 at 2:25 PM, Dave Gerlach <d-gerlach@ti.com> wrote:
>>> Driver to read from a register and depending on either set bits or
>>> a specific known selectively enable or disable OPPs based on DT node.
>>>
>>> Can support opp-modifier-reg-bit where single bits within the register
>>> determine the availability of an OPP or opp-modifier-reg-val where a
>>> certain value inside the register or a portion of it determine what the
>>> maximum allowed OPP is.
>>>
>>> The driver expects a device that has already has its OPPs loaded
>>> and then will disable the OPPs not matching the criteria specified in
>>> the opp-modifier table.
>>>
>>> Signed-off-by: Dave Gerlach <d-gerlach@ti.com>
>>> ---
>>>  .../devicetree/bindings/power/opp-modifier.txt     | 111 +++++++++
>>>  drivers/power/opp/Makefile                         |   1 +
>>>  drivers/power/opp/opp-modifier-reg.c               | 259 +++++++++++++++++++++
>>>  3 files changed, 371 insertions(+)
>>>  create mode 100644 Documentation/devicetree/bindings/power/opp-modifier.txt
>>>  create mode 100644 drivers/power/opp/opp-modifier-reg.c
>>>
>>> diff --git a/Documentation/devicetree/bindings/power/opp-modifier.txt b/Documentation/devicetree/bindings/power/opp-modifier.txt
>>> new file mode 100644
>>> index 0000000..af8a2e9
>>> --- /dev/null
>>> +++ b/Documentation/devicetree/bindings/power/opp-modifier.txt
>>> @@ -0,0 +1,111 @@
>>> +* OPP-Modifier - opp modifier to selectively enable operating points
>>> +
>>> +Many SoCs that have selectively modifiable OPPs can specify
>>> +all available OPPs in their operating-points listing and then define
>>> +opp_modifiers to enable or disable the OPPs that are actually available
>>> +on the specific hardware.
>>> +
>>> +* OPP Modifier Provider
>>
>> Uggg. Please stop designing around the current OPP binding which has
>> the problem that the OPP table is not extensible to add more data.
>> Define a new OPP binding that solves these problems. This is at least
> Generically, there are three different issues with current OPP bindings:
> a) ability to enable disable certain OPPs depending on SoC OTP/Efuse
> settings.

More generically: ...depending on variety of factors.

> b) ability to reuse OPPs defined for one device node for another (cpu1
> to reuse OPP definitions of cpu0)
> c) ability to add additional information per OPP. we can argue this is
> a superset of (a), but really, the problems are different.

It is all additional data per OPP. Additional different information is
of course for different problems. That doesn't mean we need different
solutions.

> Previous proposals include making each OPP as a phandle, but there
> does not seem much traction in that direction either. - proposal here
> has nothing to do with (b) or (c).

They may have nothing to do with each other, but they all have to do
with the OPP binding. If we're going to change/extend the binding,
then all issues need to be taken into account.

>> the 3rd OPP related binding addition I've seen recently. But I
>> wouldn't spend a lot of effort on a new OPP binding just to add the
>> functionality you are adding here because I don't like the whole
>> concept in general. This might be a common way to determine valid OPPs
>> on TI chips, but I think it is too low level and I don't want to see
>
> Not just TI chips, but iMX, now, Marvell, Xilinx as well. potentially
> more as well. doing OTP/Efuse based decision on which OPPs are valid
> on a chip is not a TI specific thing. This was the reason for us to
> try to define something generic enough to be reused by more SoCs than
> just TI.

Agreed, but I'm not convinced how different SOCs determine valid OPPs
is common enough. Certainly how to mark an entry disabled is common
though.

>> bindings for every different possible way. Just add platform code to
>> do the OPP setup you need.
> Errr.. adding platform code means the hardware description goes back
> to kernel. is'nt that giving up on device tree binding for describing
> hardware?

We're always going to have some platform code. I'm not saying you have
to in this case. I'm saying either come up with an OPP binding
addressing all these issues or live with the existing one and fix it
up in the kernel or bootloader.

>> Frankly, I prefer the bootloader/firmware fixup the OPP table approach
>> mentioned in the cpufreq-cpu0 thread. Somewhat less desirable, but the
>> kernel could do the fixups as well (via of_update_property).
>
> a) Trying to move the hardware definition away from device tree seems
> to me a major step backwards.
> b) Allowing for definitions in platform code is a step backwards again
> for a generic solution that works for more than 1 vendor.
> c) moving the logic away to bootloader when it can easily be done in
> kernel again is adding burden to bootloader for data it does need to
> handle.

The burden has to be somewhere. Maintaining a binding forever in the
kernel is a burden as well if it is poorly designed.

Valid OPPs are not going to just be random. There's probably on a few
combinations and they'll be based on part# or speed grade or something
(which in turn defines the efuses in your case). While a dev board may
have random parts on it, an actual product would not. I could argue
that your DTB just needs to be correct to begin with for a given
part/design. Obviously, managing minor differences in a DTB like this
can be a pain. This is why firmware or bootloaders do adjustments to
the DTB at runtime and it is quite common.

> OPP is a hardware behavior, which OPPs are enabled are described in
> hardware on certain SoCs. the current proposal is to provide a generic
> solution for those devices that allow for dynamic definition of OPPs
> based on SoC efuse definition.

What if the decision is not based on a single register bit? Perhaps
efuses are not directly memory mapped. Maybe it is based on Si
revision. Or you need to limit frequency because a certain board can't
supply adequate current. You call this generic, but it is not. It
doesn't even solve the part that is generic which is marking some OPPs
disabled.

Rob
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Nishanth Menon March 18, 2014, 3:36 p.m. UTC | #4
On 03/17/2014 01:37 PM, Rob Herring wrote:
> On Mon, Mar 17, 2014 at 9:30 AM, Nishanth Menon <nm@ti.com> wrote:
>> On 03/14/2014 04:00 PM, Rob Herring wrote:
>>> On Fri, Mar 14, 2014 at 2:25 PM, Dave Gerlach <d-gerlach@ti.com> wrote:
>>>> Driver to read from a register and depending on either set bits or
>>>> a specific known selectively enable or disable OPPs based on DT node.
>>>>
>>>> Can support opp-modifier-reg-bit where single bits within the register
>>>> determine the availability of an OPP or opp-modifier-reg-val where a
>>>> certain value inside the register or a portion of it determine what the
>>>> maximum allowed OPP is.
>>>>
>>>> The driver expects a device that has already has its OPPs loaded
>>>> and then will disable the OPPs not matching the criteria specified in
>>>> the opp-modifier table.
>>>>
>>>> Signed-off-by: Dave Gerlach <d-gerlach@ti.com>
>>>> ---
>>>>  .../devicetree/bindings/power/opp-modifier.txt     | 111 +++++++++
>>>>  drivers/power/opp/Makefile                         |   1 +
>>>>  drivers/power/opp/opp-modifier-reg.c               | 259 +++++++++++++++++++++
>>>>  3 files changed, 371 insertions(+)
>>>>  create mode 100644 Documentation/devicetree/bindings/power/opp-modifier.txt
>>>>  create mode 100644 drivers/power/opp/opp-modifier-reg.c
>>>>
>>>> diff --git a/Documentation/devicetree/bindings/power/opp-modifier.txt b/Documentation/devicetree/bindings/power/opp-modifier.txt
>>>> new file mode 100644
>>>> index 0000000..af8a2e9
>>>> --- /dev/null
>>>> +++ b/Documentation/devicetree/bindings/power/opp-modifier.txt
>>>> @@ -0,0 +1,111 @@
>>>> +* OPP-Modifier - opp modifier to selectively enable operating points
>>>> +
>>>> +Many SoCs that have selectively modifiable OPPs can specify
>>>> +all available OPPs in their operating-points listing and then define
>>>> +opp_modifiers to enable or disable the OPPs that are actually available
>>>> +on the specific hardware.
>>>> +
>>>> +* OPP Modifier Provider
>>>
>>> Uggg. Please stop designing around the current OPP binding which has
>>> the problem that the OPP table is not extensible to add more data.
>>> Define a new OPP binding that solves these problems. This is at least
>> Generically, there are three different issues with current OPP bindings:
>> a) ability to enable disable certain OPPs depending on SoC OTP/Efuse
>> settings.
> 
> More generically: ...depending on variety of factors.

Agreed.

> 
>> b) ability to reuse OPPs defined for one device node for another (cpu1
>> to reuse OPP definitions of cpu0)
>> c) ability to add additional information per OPP. we can argue this is
>> a superset of (a), but really, the problems are different.
> 
> It is all additional data per OPP. Additional different information is
> of course for different problems. That doesn't mean we need different
> solutions.
> 
>> Previous proposals include making each OPP as a phandle, but there
>> does not seem much traction in that direction either. - proposal here
>> has nothing to do with (b) or (c).
> 
> They may have nothing to do with each other, but they all have to do
> with the OPP binding. If we're going to change/extend the binding,
> then all issues need to be taken into account.

We aren't extending the existing binding in this series. We are just
defining how hardware description of which OPPs are valid here.

>>> the 3rd OPP related binding addition I've seen recently. But I
>>> wouldn't spend a lot of effort on a new OPP binding just to add the
>>> functionality you are adding here because I don't like the whole
>>> concept in general. This might be a common way to determine valid OPPs
>>> on TI chips, but I think it is too low level and I don't want to see
>>
>> Not just TI chips, but iMX, now, Marvell, Xilinx as well. potentially
>> more as well. doing OTP/Efuse based decision on which OPPs are valid
>> on a chip is not a TI specific thing. This was the reason for us to
>> try to define something generic enough to be reused by more SoCs than
>> just TI.
> 
> Agreed, but I'm not convinced how different SOCs determine valid OPPs
> is common enough. Certainly how to mark an entry disabled is common
> though.

Fair enough, without procuring NDA documents for all the SoCs, I
cannot comment much either, all we can do is see threads such as
http://marc.info/?t=139470791100003&r=1&w=2 and propose.

This series does include iMx as well which seems to have equivalent
challenges.

I have given examples here on how the current driver at least tries to
make generic the instances of SoCs that we have today, further, the
driver in no way constraints us from using opp_modifier_register with
proper ops in case we do something weirdly different (example: use non
memory mapped operations) - it is just a simple framework.

> 
>>> bindings for every different possible way. Just add platform code to
>>> do the OPP setup you need.
>> Errr.. adding platform code means the hardware description goes back
>> to kernel. is'nt that giving up on device tree binding for describing
>> hardware?
> 
> We're always going to have some platform code. I'm not saying you have
> to in this case. I'm saying either come up with an OPP binding
> addressing all these issues or live with the existing one and fix it
> up in the kernel or bootloader.

bootloader is out of the picture considering most of the platforms
need to deal with legacy bootloaders.

then tying part of the data in kernel and part in dts!
> 
>>> Frankly, I prefer the bootloader/firmware fixup the OPP table approach
>>> mentioned in the cpufreq-cpu0 thread. Somewhat less desirable, but the
>>> kernel could do the fixups as well (via of_update_property).
>>
>> a) Trying to move the hardware definition away from device tree seems
>> to me a major step backwards.
>> b) Allowing for definitions in platform code is a step backwards again
>> for a generic solution that works for more than 1 vendor.
>> c) moving the logic away to bootloader when it can easily be done in
>> kernel again is adding burden to bootloader for data it does need to
>> handle.
> 
> The burden has to be somewhere. Maintaining a binding forever in the
> kernel is a burden as well if it is poorly designed.

> 
> Valid OPPs are not going to just be random. There's probably on a few
> combinations and they'll be based on part# or speed grade or something
> (which in turn defines the efuses in your case). While a dev board may
> have random parts on it, an actual product would not. I could argue
> that your DTB just needs to be correct to begin with for a given
> part/design. Obviously, managing minor differences in a DTB like this
> can be a pain. This is why firmware or bootloaders do adjustments to
> the DTB at runtime and it is quite common.
Bootloaders may not always be capable of doing things or may just be
legacy bootloader that were created in a world where kernel was self
sustaining (opp data was in the kernel previously in OMAP as an
example). asking bootloader to change to ensure dtbs are proper is
just opening up another can of worms here.

> 
>> OPP is a hardware behavior, which OPPs are enabled are described in
>> hardware on certain SoCs. the current proposal is to provide a generic
>> solution for those devices that allow for dynamic definition of OPPs
>> based on SoC efuse definition.
> 
> What if the decision is not based on a single register bit? Perhaps
> efuses are not directly memory mapped. Maybe it is based on Si
> revision. Or you need to limit frequency because a certain board can't
> supply adequate current. You call this generic, but it is not. It
> doesn't even solve the part that is generic which is marking some OPPs
> disabled.

Are we saying that having a generic layer which may decide on which
OPPs are valid and which are not is a no-no? the RFC has a few issues,
I agree, but that is part of our review process to help improve if we
think the over all concept is good enough to carry forward for next
patch iteration. You dont seem convinced enough to think that makes
sense here.

As I mentioned, patch #1 is the framework, patch #2 is a specific
implementation(and there are improvements possible)-> if we need to
add sil revision based logic OR have current supply based
implementation OR have non memory mapped based decision making,
there'd be specific drivers for them.

The key question is this: do we have an conceptual agreement that
making the decision on which OPPs are valid is a decision for the
kernel? if yes, lets make that as the standard, if kernel should not
do it, then we enforce discipline that bootloaders will mandatorily
implement dtb modification for OPP entries for all SoCs. If I
understand your thought, I think your push is for dtbs containing the
right OPP entries always.

If we agree that kernel should be standalone capable of handling valid
OPPs (which happens to be my view), then lets debate if a generic
layer such as the one proposed should be created helping all SoCs to
operate generically. So far, none of the arguments you have presented
seems to indicate such a generic layer is impossible to do.
Dave Gerlach March 25, 2014, 3:24 a.m. UTC | #5
On 03/18/2014 10:36 AM, Nishanth Menon wrote:
> On 03/17/2014 01:37 PM, Rob Herring wrote:
>> On Mon, Mar 17, 2014 at 9:30 AM, Nishanth Menon <nm@ti.com> wrote:
>>> On 03/14/2014 04:00 PM, Rob Herring wrote:
>>>> On Fri, Mar 14, 2014 at 2:25 PM, Dave Gerlach <d-gerlach@ti.com> wrote:
>>>>> Driver to read from a register and depending on either set bits or
>>>>> a specific known selectively enable or disable OPPs based on DT node.
>>>>>
>>>>> Can support opp-modifier-reg-bit where single bits within the register
>>>>> determine the availability of an OPP or opp-modifier-reg-val where a
>>>>> certain value inside the register or a portion of it determine what the
>>>>> maximum allowed OPP is.
>>>>>
>>>>> The driver expects a device that has already has its OPPs loaded
>>>>> and then will disable the OPPs not matching the criteria specified in
>>>>> the opp-modifier table.
>>>>>
>>>>> Signed-off-by: Dave Gerlach <d-gerlach@ti.com>
>>>>> ---
>>>>>   .../devicetree/bindings/power/opp-modifier.txt     | 111 +++++++++
>>>>>   drivers/power/opp/Makefile                         |   1 +
>>>>>   drivers/power/opp/opp-modifier-reg.c               | 259 +++++++++++++++++++++
>>>>>   3 files changed, 371 insertions(+)
>>>>>   create mode 100644 Documentation/devicetree/bindings/power/opp-modifier.txt
>>>>>   create mode 100644 drivers/power/opp/opp-modifier-reg.c
>>>>>
>>>>> diff --git a/Documentation/devicetree/bindings/power/opp-modifier.txt b/Documentation/devicetree/bindings/power/opp-modifier.txt
>>>>> new file mode 100644
>>>>> index 0000000..af8a2e9
>>>>> --- /dev/null
>>>>> +++ b/Documentation/devicetree/bindings/power/opp-modifier.txt
>>>>> @@ -0,0 +1,111 @@
>>>>> +* OPP-Modifier - opp modifier to selectively enable operating points
>>>>> +
>>>>> +Many SoCs that have selectively modifiable OPPs can specify
>>>>> +all available OPPs in their operating-points listing and then define
>>>>> +opp_modifiers to enable or disable the OPPs that are actually available
>>>>> +on the specific hardware.
>>>>> +
>>>>> +* OPP Modifier Provider
>>>>
>>>> Uggg. Please stop designing around the current OPP binding which has
>>>> the problem that the OPP table is not extensible to add more data.
>>>> Define a new OPP binding that solves these problems. This is at least
>>> Generically, there are three different issues with current OPP bindings:
>>> a) ability to enable disable certain OPPs depending on SoC OTP/Efuse
>>> settings.
>>
>> More generically: ...depending on variety of factors.
>
> Agreed.

The idea here was not to touch the existing OPP bindings, hence the opp 
"modifier" name. As Nishanth stated, this is not extending the binding; 
the opp-modifier just uses the frequency values as an identifier but the 
driver does not necessarily care where the OPPs already in the table 
came from, just that they correspond to the same frequencies it 
describes. And again, just to reiterate, nothing is binding the user to 
use the opp-modifier-reg child driver, any driver could be written to 
decide which OPPs to enable or disable.

With that said, I do understand that this is far from a perfect solution 
to the issue of defining which OPPs are available, I meant it as a 
suggestion for a possible way forward. This could be used as a starting 
point for something even more generic. It's a common problem on many 
SoCs even if it is defined in completely different ways so this 
framework or one like it could give a common point to branch out from.

If we don't want to move forward with a generic layer to handle OPP 
availability, what is the best option? Does anybody else have opinions 
on this? Regardless of what is decided if everyone can agree on a 
direction we can all move forward.

Regards,
Dave

>
>>
>>> b) ability to reuse OPPs defined for one device node for another (cpu1
>>> to reuse OPP definitions of cpu0)
>>> c) ability to add additional information per OPP. we can argue this is
>>> a superset of (a), but really, the problems are different.
>>
>> It is all additional data per OPP. Additional different information is
>> of course for different problems. That doesn't mean we need different
>> solutions.
>>
>>> Previous proposals include making each OPP as a phandle, but there
>>> does not seem much traction in that direction either. - proposal here
>>> has nothing to do with (b) or (c).
>>
>> They may have nothing to do with each other, but they all have to do
>> with the OPP binding. If we're going to change/extend the binding,
>> then all issues need to be taken into account.
>
> We aren't extending the existing binding in this series. We are just
> defining how hardware description of which OPPs are valid here.
>
>>>> the 3rd OPP related binding addition I've seen recently. But I
>>>> wouldn't spend a lot of effort on a new OPP binding just to add the
>>>> functionality you are adding here because I don't like the whole
>>>> concept in general. This might be a common way to determine valid OPPs
>>>> on TI chips, but I think it is too low level and I don't want to see
>>>
>>> Not just TI chips, but iMX, now, Marvell, Xilinx as well. potentially
>>> more as well. doing OTP/Efuse based decision on which OPPs are valid
>>> on a chip is not a TI specific thing. This was the reason for us to
>>> try to define something generic enough to be reused by more SoCs than
>>> just TI.
>>
>> Agreed, but I'm not convinced how different SOCs determine valid OPPs
>> is common enough. Certainly how to mark an entry disabled is common
>> though.
>
> Fair enough, without procuring NDA documents for all the SoCs, I
> cannot comment much either, all we can do is see threads such as
> http://marc.info/?t=139470791100003&r=1&w=2 and propose.
>
> This series does include iMx as well which seems to have equivalent
> challenges.
>
> I have given examples here on how the current driver at least tries to
> make generic the instances of SoCs that we have today, further, the
> driver in no way constraints us from using opp_modifier_register with
> proper ops in case we do something weirdly different (example: use non
> memory mapped operations) - it is just a simple framework.
>
>>
>>>> bindings for every different possible way. Just add platform code to
>>>> do the OPP setup you need.
>>> Errr.. adding platform code means the hardware description goes back
>>> to kernel. is'nt that giving up on device tree binding for describing
>>> hardware?
>>
>> We're always going to have some platform code. I'm not saying you have
>> to in this case. I'm saying either come up with an OPP binding
>> addressing all these issues or live with the existing one and fix it
>> up in the kernel or bootloader.
>
> bootloader is out of the picture considering most of the platforms
> need to deal with legacy bootloaders.
>
> then tying part of the data in kernel and part in dts!
>>
>>>> Frankly, I prefer the bootloader/firmware fixup the OPP table approach
>>>> mentioned in the cpufreq-cpu0 thread. Somewhat less desirable, but the
>>>> kernel could do the fixups as well (via of_update_property).
>>>
>>> a) Trying to move the hardware definition away from device tree seems
>>> to me a major step backwards.
>>> b) Allowing for definitions in platform code is a step backwards again
>>> for a generic solution that works for more than 1 vendor.
>>> c) moving the logic away to bootloader when it can easily be done in
>>> kernel again is adding burden to bootloader for data it does need to
>>> handle.
>>
>> The burden has to be somewhere. Maintaining a binding forever in the
>> kernel is a burden as well if it is poorly designed.
>
>>
>> Valid OPPs are not going to just be random. There's probably on a few
>> combinations and they'll be based on part# or speed grade or something
>> (which in turn defines the efuses in your case). While a dev board may
>> have random parts on it, an actual product would not. I could argue
>> that your DTB just needs to be correct to begin with for a given
>> part/design. Obviously, managing minor differences in a DTB like this
>> can be a pain. This is why firmware or bootloaders do adjustments to
>> the DTB at runtime and it is quite common.
> Bootloaders may not always be capable of doing things or may just be
> legacy bootloader that were created in a world where kernel was self
> sustaining (opp data was in the kernel previously in OMAP as an
> example). asking bootloader to change to ensure dtbs are proper is
> just opening up another can of worms here.
>
>>
>>> OPP is a hardware behavior, which OPPs are enabled are described in
>>> hardware on certain SoCs. the current proposal is to provide a generic
>>> solution for those devices that allow for dynamic definition of OPPs
>>> based on SoC efuse definition.
>>
>> What if the decision is not based on a single register bit? Perhaps
>> efuses are not directly memory mapped. Maybe it is based on Si
>> revision. Or you need to limit frequency because a certain board can't
>> supply adequate current. You call this generic, but it is not. It
>> doesn't even solve the part that is generic which is marking some OPPs
>> disabled.
>
> Are we saying that having a generic layer which may decide on which
> OPPs are valid and which are not is a no-no? the RFC has a few issues,
> I agree, but that is part of our review process to help improve if we
> think the over all concept is good enough to carry forward for next
> patch iteration. You dont seem convinced enough to think that makes
> sense here.
>
> As I mentioned, patch #1 is the framework, patch #2 is a specific
> implementation(and there are improvements possible)-> if we need to
> add sil revision based logic OR have current supply based
> implementation OR have non memory mapped based decision making,
> there'd be specific drivers for them.
>
> The key question is this: do we have an conceptual agreement that
> making the decision on which OPPs are valid is a decision for the
> kernel? if yes, lets make that as the standard, if kernel should not
> do it, then we enforce discipline that bootloaders will mandatorily
> implement dtb modification for OPP entries for all SoCs. If I
> understand your thought, I think your push is for dtbs containing the
> right OPP entries always.
>
> If we agree that kernel should be standalone capable of handling valid
> OPPs (which happens to be my view), then lets debate if a generic
> layer such as the one proposed should be created helping all SoCs to
> operate generically. So far, none of the arguments you have presented
> seems to indicate such a generic layer is impossible to do.
>

--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
diff mbox

Patch

diff --git a/Documentation/devicetree/bindings/power/opp-modifier.txt b/Documentation/devicetree/bindings/power/opp-modifier.txt
new file mode 100644
index 0000000..af8a2e9
--- /dev/null
+++ b/Documentation/devicetree/bindings/power/opp-modifier.txt
@@ -0,0 +1,111 @@ 
+* OPP-Modifier - opp modifier to selectively enable operating points
+
+Many SoCs that have selectively modifiable OPPs can specify
+all available OPPs in their operating-points listing and then define
+opp_modifiers to enable or disable the OPPs that are actually available
+on the specific hardware.
+
+* OPP Modifier Provider
+
+For single bits that define the availability of an OPP:
+-------------------------------------------
+Some SoCs define a bit in a register that indicates whether an OPP is
+available. This will disable any OPP with a frequency corresponding to
+the bit given if it is not set appropriately.
+
+properties:
+- compatible : Should be "opp-modifier-reg-bit"
+- reg : Address and length of the registers needed to identify available
+	OPPs, here we provide just the register containing OPP data.
+
+Optional Properties:
+- opp,reg-bit-enable-low: Take the complement of register before comparing mask
+		     defined below under opp-modifier.
+
+Sub-nodes:
+Sub-nodes are defined as a container to hold opp modifier table for a
+specific device with an operating-points table already defined
+
+Sub-node properties:
+- opp-modifier: A collection of rows consisting of the following entries to
+		allow specification of available OPPs:
+	-kHz: The opp to be enabled based on following criteria
+	-offset: Offset into register where relevant bits are located
+	-value: Bit that indicates availability of OPP
+
+Example:
+
+	opp_modifier: opp_modifier@0x44e107fc {
+		compatible = "opp-modifier-reg-bit";
+		reg = <0x44e107fc 0x04>;
+
+		mpu_opp_modifier: mpu_opp_modifier {
+			opp-modifier = <
+			/* kHz   offset  value */
+			1000000  0	BIT_1
+			720000   0	BIT_2
+			>;
+		};
+	};
+
+For a value that defines the maximum available OPP:
+-------------------------------------------
+Some SoCs define a value in a register that corresponds to an OPP. If
+that value is matched this will disable all OPPs greater than the
+associated frequency.
+
+properties:
+- compatible : Should be "opp-modifier-reg-val"
+- reg : Address and length of the registers needed to identify available
+	OPPs, here we provide just the register containing OPP data.
+
+Optional Properties:
+- opp,reg-mask: Only compare the bits masked off by this value.
+
+Sub-nodes:
+Sub-nodes are defined as a container to hold opp modifier table for a
+specific device with an operating-points table already defined
+
+Sub-node properties:
+- opp-modifier: A collection of rows consisting of the following entries to
+		allow specification of available OPPs:
+	-kHz: The opp to be enabled based on following criteria
+	-offset: Offset into register where relevant bits are located
+	-value: Value that indicates maximum available OPP
+
+Example:
+
+	opp_modifier: opp_modifier@0x44e107fc {
+		compatible = "opp-modifier-reg-val";
+		reg = <0x44e107fc 0x04>;
+
+		mpu_opp_modifier: mpu_opp_modifier {
+			opp-modifier = <
+			/* kHz   offset  value */
+			1000000  0	VAL_1
+			720000   0	VAL_2
+			>;
+		};
+	};
+
+* OPP Modifier Consumer
+
+Properties:
+- platform-opp-modifier: phandle to the sub-node of the proper opp-modifier
+		provider that contains the appropriate opp-modifier table
+
+Example:
+
+cpu@0 {
+        compatible = "arm,cortex-a8";
+        device_type = "cpu";
+
+        operating-points = <
+                /* kHz    uV */
+                1000000 1351000
+                720000  1285000
+        >;
+
+        platform-opp-modifier = <&mpu_opp_modifier>;
+};
+
diff --git a/drivers/power/opp/Makefile b/drivers/power/opp/Makefile
index 820eb10..7f60adc 100644
--- a/drivers/power/opp/Makefile
+++ b/drivers/power/opp/Makefile
@@ -1 +1,2 @@ 
 obj-y += core.o
+obj-y += opp-modifier-reg.o
diff --git a/drivers/power/opp/opp-modifier-reg.c b/drivers/power/opp/opp-modifier-reg.c
new file mode 100644
index 0000000..f4dcf7a
--- /dev/null
+++ b/drivers/power/opp/opp-modifier-reg.c
@@ -0,0 +1,259 @@ 
+/*
+ * Single bit OPP Modifier Driver
+ *
+ * Copyright (C) 2013 Texas Instruments Incorporated - http://www.ti.com/
+ * Dave Gerlach <d-gerlach@ti.com>
+ *
+ * This program is free software; you can redistribute it and/or
+ * modify it under the terms of the GNU General Public License as
+ * published by the Free Software Foundation version 2.
+ *
+ * This program is distributed "as is" WITHOUT ANY WARRANTY of any
+ * kind, whether express or implied; without even the implied warranty
+ * of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+ * GNU General Public License for more details.
+ */
+
+#include <linux/init.h>
+#include <linux/module.h>
+#include <linux/io.h>
+#include <linux/slab.h>
+#include <linux/err.h>
+#include <linux/list.h>
+
+#include <linux/of.h>
+#include <linux/of_device.h>
+#include <linux/pm_opp.h>
+#include <linux/opp-modifier.h>
+
+static struct of_device_id opp_omap_of_match[];
+
+struct opp_reg_context {
+	struct device   *dev;
+	void __iomem    *reg;
+	u32	mask;
+	bool		enable_low;
+	int (*modify)(struct device *dev, const struct property *prop);
+};
+
+static struct opp_reg_context *opp_reg;
+
+static unsigned long opp_reg_read(int offset)
+{
+	return readl(opp_reg->reg + offset);
+}
+
+static int opp_modifier_reg_bit_enable(struct device *dev,
+				       const struct property *prop)
+{
+	const __be32 *val;
+	unsigned long reg_val, freq, offset, bit;
+	int idx;
+
+	val = prop->value;
+	idx = (prop->length / sizeof(u32)) / 3;
+	while (idx--) {
+		freq = be32_to_cpup(val++) * 1000;
+		offset = be32_to_cpup(val++);
+		bit = be32_to_cpup(val++);
+
+		reg_val = opp_reg_read(offset);
+
+		if (opp_reg->enable_low)
+			reg_val = ~reg_val;
+
+		if (!(reg_val & bit))
+			dev_pm_opp_disable(dev, freq);
+	}
+	return 0;
+}
+
+static int opp_modifier_reg_value_enable(struct device *dev,
+					 const struct property *prop)
+{
+	const __be32 *val;
+	unsigned long reg_val, freq, offset, bits;
+	unsigned long disable_freq, search_freq;
+	struct dev_pm_opp *disable_opp;
+	int idx, i, opp_count;
+
+	val = prop->value;
+	idx = (prop->length / sizeof(u32)) / 3;
+
+	while (idx--) {
+		freq = be32_to_cpup(val++) * 1000;
+		offset = be32_to_cpup(val++);
+		bits = be32_to_cpup(val++);
+
+		reg_val = opp_reg_read(offset);
+
+		if ((reg_val & opp_reg->mask) == bits) {
+			/*
+			 * Find all frequencies greater than current freq
+			 */
+			search_freq = freq + 1;
+			rcu_read_lock();
+			opp_count = dev_pm_opp_get_opp_count(dev);
+			rcu_read_unlock();
+
+			for (i = 0; i < opp_count; i++) {
+				rcu_read_lock();
+				disable_opp =
+					dev_pm_opp_find_freq_ceil(dev,
+								  &search_freq);
+				if (IS_ERR(disable_opp)) {
+					rcu_read_unlock();
+					break;
+				}
+				disable_freq =
+					dev_pm_opp_get_freq(disable_opp);
+				rcu_read_unlock();
+				dev_pm_opp_disable(dev, disable_freq);
+			}
+		}
+	}
+
+	return 0;
+}
+
+static int of_opp_check_availability(struct device *dev, struct device_node *np)
+{
+	const struct property *prop;
+	int nr;
+
+	if (!dev || !np)
+		return -EINVAL;
+
+	prop = of_find_property(np, "opp-modifier", NULL);
+	if (!prop)
+		return -EINVAL;
+	if (!prop->value)
+		return -EINVAL;
+
+	nr = prop->length / sizeof(u32);
+	if (nr % 3) {
+		pr_err("%s: Invalid OPP Available list\n", __func__);
+		return -EINVAL;
+	}
+
+	return opp_reg->modify(dev, prop);
+}
+
+static int opp_modifier_reg_device_modify(struct device *dev)
+{
+	struct device_node *np;
+	int ret;
+
+	if (!dev)
+		return -EINVAL;
+
+	np = of_parse_phandle(dev->of_node, "platform-opp-modifier", 0);
+
+	if (!np)
+		return -EINVAL;
+
+	ret = of_opp_check_availability(dev, np);
+
+	if (ret)
+		pr_err("Error modifying available OPPs\n");
+
+	of_node_put(np);
+
+	return ret;
+}
+
+static struct opp_modifier_ops opp_modifier_reg_ops = {
+	.modify = opp_modifier_reg_device_modify,
+};
+
+static struct opp_modifier_dev opp_modifier_reg_dev = {
+	.ops = &opp_modifier_reg_ops,
+};
+
+static struct of_device_id opp_modifier_reg_of_match[] = {
+	{
+		.compatible = "opp-modifier-reg-bit",
+		.data = &opp_modifier_reg_bit_enable,
+	},
+	{
+		.compatible = "opp-modifier-reg-val",
+		.data = &opp_modifier_reg_value_enable,
+	},
+	{ },
+};
+MODULE_DEVICE_TABLE(of, opp_modifier_reg_of_match);
+
+static int opp_modifier_reg_probe(struct platform_device *pdev)
+{
+	const struct of_device_id *match;
+	struct resource *res;
+	struct device_node *np = pdev->dev.of_node;
+	int ret = 0;
+
+	opp_reg = devm_kzalloc(&pdev->dev, sizeof(*opp_reg), GFP_KERNEL);
+	if (!opp_reg) {
+		dev_err(opp_reg->dev, "reg context memory allocation failed\n");
+		ret = -ENOMEM;
+		goto err;
+	}
+
+	match = of_match_device(opp_modifier_reg_of_match, &pdev->dev);
+
+	if (!match) {
+		dev_err(&pdev->dev, "Invalid match data value\n");
+		ret = -EINVAL;
+		goto err;
+	}
+
+	opp_reg->modify = (void *)match->data;
+
+	opp_reg->dev = &pdev->dev;
+
+	res = platform_get_resource(pdev, IORESOURCE_MEM, 0);
+	if (!res) {
+		dev_err(&pdev->dev, "no memory resource for opp register\n");
+		ret = -ENXIO;
+		goto err;
+	}
+
+	opp_reg->reg = devm_request_and_ioremap(opp_reg->dev, res);
+	if (!opp_reg->reg) {
+		dev_err(opp_reg->dev, "could not ioremap opp register\n");
+		ret = -EADDRNOTAVAIL;
+		goto err;
+	}
+
+	if (of_get_property(np, "opp,reg-bit-enable-low", NULL))
+		opp_reg->enable_low = true;
+
+	of_property_read_u32(np, "opp,reg-mask", &opp_reg->mask);
+
+	opp_modifier_reg_dev.ops = &opp_modifier_reg_ops;
+	opp_modifier_reg_dev.of_node = pdev->dev.of_node;
+
+	opp_modifier_register(&opp_modifier_reg_dev);
+
+err:
+	return ret;
+}
+
+static int opp_modifier_reg_remove(struct platform_device *pdev)
+{
+	return 0;
+}
+
+static struct platform_driver opp_modifier_reg_driver = {
+	.probe		= opp_modifier_reg_probe,
+	.remove		= opp_modifier_reg_remove,
+	.driver = {
+		.owner		= THIS_MODULE,
+		.name		= "opp-modifier-reg",
+		.of_match_table	= opp_modifier_reg_of_match,
+	},
+};
+
+module_platform_driver(opp_modifier_reg_driver);
+
+MODULE_AUTHOR("Dave Gerlach <d-gerlach@ti.com>");
+MODULE_DESCRIPTION("OPP Modifier driver for eFuse defined OPPs");
+MODULE_LICENSE("GPL v2");