re-set rtc date on reset handler

Message ID	1251822154-5423-1-git-send-email-glommer@redhat.com
State	Superseded
Headers	show Return-Path: <qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org> From: Glauber Costa <glommer@redhat.com> To: qemu-devel@nongnu.org Date: Tue, 1 Sep 2009 12:22:34 -0400 Message-Id: <1251822154-5423-1-git-send-email-glommer@redhat.com> Cc: aliguori@us.ibm.com Subject: [Qemu-devel] [PATCH] re-set rtc date on reset handler Precedence: list Sender: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org

Glauber Costa Sept. 1, 2009, 4:22 p.m. UTC

guests without a stable timesource such as kvm-clock will grab the wallclock
from our rtc chip. However, we only sync the date when we first launch qemu.
If a guest goes through a series of reboot cycles, it will slowly see time
getting far behind the host.

The proposal of this patch is to set the date to host clock again in the reset
handler. With this patch, I see a Fedora guest keeping its clock in sync upon
an ulimited number of reboots.

Signed-off-by: Glauber Costa <glommer@redhat.com>
---
 hw/mc146818rtc.c |    2 ++
 1 files changed, 2 insertions(+), 0 deletions(-)

Blue Swirl Sept. 1, 2009, 4:56 p.m. UTC | #1

On Tue, Sep 1, 2009 at 7:22 PM, Glauber Costa<glommer@redhat.com> wrote:
> guests without a stable timesource such as kvm-clock will grab the wallclock
> from our rtc chip. However, we only sync the date when we first launch qemu.
> If a guest goes through a series of reboot cycles, it will slowly see time
> getting far behind the host.
>
> The proposal of this patch is to set the date to host clock again in the reset
> handler. With this patch, I see a Fedora guest keeping its clock in sync upon
> an ulimited number of reboots.

A different approach is used in m48t59.c: the guest clock is generated
directly from host clock without any timers and only a fixed offset
(to account for time when guest was stopped) is added, so the clock
will always in sync.

Glauber Costa Sept. 1, 2009, 11 p.m. UTC | #2

On Tue, Sep 1, 2009 at 1:56 PM, Blue Swirl<blauwirbel@gmail.com> wrote:
> On Tue, Sep 1, 2009 at 7:22 PM, Glauber Costa<glommer@redhat.com> wrote:
>> guests without a stable timesource such as kvm-clock will grab the wallclock
>> from our rtc chip. However, we only sync the date when we first launch qemu.
>> If a guest goes through a series of reboot cycles, it will slowly see time
>> getting far behind the host.
>>
>> The proposal of this patch is to set the date to host clock again in the reset
>> handler. With this patch, I see a Fedora guest keeping its clock in sync upon
>> an ulimited number of reboots.
>
> A different approach is used in m48t59.c: the guest clock is generated
> directly from host clock without any timers and only a fixed offset
You mean at every ioport read? I actually thought about that, but was
too lazy...

Dor Laor Sept. 8, 2009, 2:55 p.m. UTC | #3

On 09/02/2009 02:00 AM, Glauber Costa wrote:
> On Tue, Sep 1, 2009 at 1:56 PM, Blue Swirl<blauwirbel@gmail.com>  wrote:
>> On Tue, Sep 1, 2009 at 7:22 PM, Glauber Costa<glommer@redhat.com>  wrote:
>>> guests without a stable timesource such as kvm-clock will grab the wallclock
>>> from our rtc chip. However, we only sync the date when we first launch qemu.
>>> If a guest goes through a series of reboot cycles, it will slowly see time
>>> getting far behind the host.
>>>
>>> The proposal of this patch is to set the date to host clock again in the reset
>>> handler. With this patch, I see a Fedora guest keeping its clock in sync upon
>>> an ulimited number of reboots.
>>
>> A different approach is used in m48t59.c: the guest clock is generated
>> directly from host clock without any timers and only a fixed offset
> You mean at every ioport read? I actually thought about that, but was
> too lazy...
>
>

You also need to handle the -startdate flag of qemu.
In this case you need to keep the offset and not read the host time.

Jan Kiszka Sept. 8, 2009, 4:15 p.m. UTC | #4

Blue Swirl wrote:
> On Tue, Sep 1, 2009 at 7:22 PM, Glauber Costa<glommer@redhat.com> wrote:
>> guests without a stable timesource such as kvm-clock will grab the wallclock
>> from our rtc chip. However, we only sync the date when we first launch qemu.
>> If a guest goes through a series of reboot cycles, it will slowly see time
>> getting far behind the host.
>>
>> The proposal of this patch is to set the date to host clock again in the reset
>> handler. With this patch, I see a Fedora guest keeping its clock in sync upon
>> an ulimited number of reboots.
> 
> A different approach is used in m48t59.c: the guest clock is generated
> directly from host clock without any timers and only a fixed offset
> (to account for time when guest was stopped) is added, so the clock
> will always in sync.

Ah, that looks like a useful approach! We currently face the issue of
vRTC drifting away from the host time (as the latter is tuned by NTP).

Do you or anyone else know if switching the PC RTC over to the scheme
used in the m48t59 may have some downsides? If not, I would happily hack
up a patch ASAP and suggest it for mainline.

Jan

Jan Kiszka Sept. 8, 2009, 4:50 p.m. UTC | #5

Jan Kiszka wrote:
> Blue Swirl wrote:
>> On Tue, Sep 1, 2009 at 7:22 PM, Glauber Costa<glommer@redhat.com> wrote:
>>> guests without a stable timesource such as kvm-clock will grab the wallclock
>>> from our rtc chip. However, we only sync the date when we first launch qemu.
>>> If a guest goes through a series of reboot cycles, it will slowly see time
>>> getting far behind the host.
>>>
>>> The proposal of this patch is to set the date to host clock again in the reset
>>> handler. With this patch, I see a Fedora guest keeping its clock in sync upon
>>> an ulimited number of reboots.
>> A different approach is used in m48t59.c: the guest clock is generated
>> directly from host clock without any timers and only a fixed offset
>> (to account for time when guest was stopped) is added, so the clock
>> will always in sync.
> 
> Ah, that looks like a useful approach! We currently face the issue of
> vRTC drifting away from the host time (as the latter is tuned by NTP).
> 
> Do you or anyone else know if switching the PC RTC over to the scheme
> used in the m48t59 may have some downsides? If not, I would happily hack
> up a patch ASAP and suggest it for mainline.

Clearly, one downside is that a jumping host time will cause the RTC to
jump as well. However, there might by setups where this does not happen,
so optionally switching the RTC over rt_clock seems like a reasonable
feature to me.

What about consolidating -localtime, -starttime and -rtc-td-hack at this
chance? Something like

 -rtc base=utc|localtime|date,clock=vm|rt

with 'date' specifying the base as in -starttime. 'clock' would then
allow to drive the RTC via the host's CLOCK_REALTIME (with all its pros
and cons).

Jan

Jan Kiszka Sept. 8, 2009, 5 p.m. UTC | #6

Jan Kiszka wrote:
> Jan Kiszka wrote:
>> Blue Swirl wrote:
>>> On Tue, Sep 1, 2009 at 7:22 PM, Glauber Costa<glommer@redhat.com> wrote:
>>>> guests without a stable timesource such as kvm-clock will grab the wallclock
>>>> from our rtc chip. However, we only sync the date when we first launch qemu.
>>>> If a guest goes through a series of reboot cycles, it will slowly see time
>>>> getting far behind the host.
>>>>
>>>> The proposal of this patch is to set the date to host clock again in the reset
>>>> handler. With this patch, I see a Fedora guest keeping its clock in sync upon
>>>> an ulimited number of reboots.
>>> A different approach is used in m48t59.c: the guest clock is generated
>>> directly from host clock without any timers and only a fixed offset
>>> (to account for time when guest was stopped) is added, so the clock
>>> will always in sync.
>> Ah, that looks like a useful approach! We currently face the issue of
>> vRTC drifting away from the host time (as the latter is tuned by NTP).
>>
>> Do you or anyone else know if switching the PC RTC over to the scheme
>> used in the m48t59 may have some downsides? If not, I would happily hack
>> up a patch ASAP and suggest it for mainline.
> 
> Clearly, one downside is that a jumping host time will cause the RTC to
> jump as well. However, there might by setups where this does not happen,
> so optionally switching the RTC over rt_clock seems like a reasonable
> feature to me.
> 
> What about consolidating -localtime, -starttime and -rtc-td-hack at this
> chance? Something like
> 
>  -rtc base=utc|localtime|date,clock=vm|rt

+ ,td-hack=on|off

of course. Or let's call it "drift-hack".

> 
> with 'date' specifying the base as in -starttime. 'clock' would then
> allow to drive the RTC via the host's CLOCK_REALTIME (with all its pros
> and cons).
> 
> Jan
> 

Jan

Dor Laor Sept. 9, 2009, 12:28 p.m. UTC | #7

On 09/08/2009 08:00 PM, Jan Kiszka wrote:
> Jan Kiszka wrote:
>> Jan Kiszka wrote:
>>> Blue Swirl wrote:
>>>> On Tue, Sep 1, 2009 at 7:22 PM, Glauber Costa<glommer@redhat.com>  wrote:
>>>>> guests without a stable timesource such as kvm-clock will grab the wallclock
>>>>> from our rtc chip. However, we only sync the date when we first launch qemu.
>>>>> If a guest goes through a series of reboot cycles, it will slowly see time
>>>>> getting far behind the host.
>>>>>
>>>>> The proposal of this patch is to set the date to host clock again in the reset
>>>>> handler. With this patch, I see a Fedora guest keeping its clock in sync upon
>>>>> an ulimited number of reboots.
>>>> A different approach is used in m48t59.c: the guest clock is generated
>>>> directly from host clock without any timers and only a fixed offset
>>>> (to account for time when guest was stopped) is added, so the clock
>>>> will always in sync.
>>> Ah, that looks like a useful approach! We currently face the issue of
>>> vRTC drifting away from the host time (as the latter is tuned by NTP).
>>>
>>> Do you or anyone else know if switching the PC RTC over to the scheme
>>> used in the m48t59 may have some downsides? If not, I would happily hack
>>> up a patch ASAP and suggest it for mainline.
>>
>> Clearly, one downside is that a jumping host time will cause the RTC to
>> jump as well. However, there might by setups where this does not happen,
>> so optionally switching the RTC over rt_clock seems like a reasonable
>> feature to me.
>>
>> What about consolidating -localtime, -starttime and -rtc-td-hack at this
>> chance? Something like
>>
>>   -rtc base=utc|localtime|date,clock=vm|rt
>
> + ,td-hack=on|off
>
> of course. Or let's call it "drift-hack".

btw: this is not a hack, its virtualization support for rtc.
It should be the default when qemu runs along with kvm.

>
>>
>> with 'date' specifying the base as in -starttime. 'clock' would then
>> allow to drive the RTC via the host's CLOCK_REALTIME (with all its pros
>> and cons).
>>
>> Jan
>>
>
> Jan
>

Jan Kiszka Sept. 9, 2009, 12:57 p.m. UTC | #8

Dor Laor wrote:
> On 09/08/2009 08:00 PM, Jan Kiszka wrote:
>> Jan Kiszka wrote:
>>> Jan Kiszka wrote:
>>>> Blue Swirl wrote:
>>>>> On Tue, Sep 1, 2009 at 7:22 PM, Glauber Costa<glommer@redhat.com>  wrote:
>>>>>> guests without a stable timesource such as kvm-clock will grab the wallclock
>>>>>> from our rtc chip. However, we only sync the date when we first launch qemu.
>>>>>> If a guest goes through a series of reboot cycles, it will slowly see time
>>>>>> getting far behind the host.
>>>>>>
>>>>>> The proposal of this patch is to set the date to host clock again in the reset
>>>>>> handler. With this patch, I see a Fedora guest keeping its clock in sync upon
>>>>>> an ulimited number of reboots.
>>>>> A different approach is used in m48t59.c: the guest clock is generated
>>>>> directly from host clock without any timers and only a fixed offset
>>>>> (to account for time when guest was stopped) is added, so the clock
>>>>> will always in sync.
>>>> Ah, that looks like a useful approach! We currently face the issue of
>>>> vRTC drifting away from the host time (as the latter is tuned by NTP).
>>>>
>>>> Do you or anyone else know if switching the PC RTC over to the scheme
>>>> used in the m48t59 may have some downsides? If not, I would happily hack
>>>> up a patch ASAP and suggest it for mainline.
>>> Clearly, one downside is that a jumping host time will cause the RTC to
>>> jump as well. However, there might by setups where this does not happen,
>>> so optionally switching the RTC over rt_clock seems like a reasonable
>>> feature to me.
>>>
>>> What about consolidating -localtime, -starttime and -rtc-td-hack at this
>>> chance? Something like
>>>
>>>   -rtc base=utc|localtime|date,clock=vm|rt
>> + ,td-hack=on|off
>>
>> of course. Or let's call it "drift-hack".
> 
> btw: this is not a hack, its virtualization support for rtc.
> It should be the default when qemu runs along with kvm.

OK, will call it 'drift-fix'. But enabling it by default in kvm mode
should be filed as a separate patch on top of my queue (which will
appear soon).

Jan

Avi Kivity Sept. 9, 2009, 1:27 p.m. UTC | #9

On 09/09/2009 03:57 PM, Jan Kiszka wrote:
>
>> btw: this is not a hack, its virtualization support for rtc.
>> It should be the default when qemu runs along with kvm.
>>      
> OK, will call it 'drift-fix'. But enabling it by default in kvm mode
> should be filed as a separate patch on top of my queue (which will
> appear soon).
>    

drift-compensation?

Jan Kiszka Sept. 9, 2009, 1:31 p.m. UTC | #10

Avi Kivity wrote:
> On 09/09/2009 03:57 PM, Jan Kiszka wrote:
>>> btw: this is not a hack, its virtualization support for rtc.
>>> It should be the default when qemu runs along with kvm.
>>>      
>> OK, will call it 'drift-fix'. But enabling it by default in kvm mode
>> should be filed as a separate patch on top of my queue (which will
>> appear soon).
>>    
> 
> drift-compensation?
> 

'drift-compensation-for-windows-guests' - this is at least what the
documentation suggests right now. I was looking for a short tag (but not
as short as 'td'...).

Jan

Avi Kivity Sept. 9, 2009, 3:04 p.m. UTC | #11

On 09/09/2009 04:31 PM, Jan Kiszka wrote:
> 'drift-compensation-for-windows-guests' - this is at least what the
> documentation suggests right now. I was looking for a short tag (but not
> as short as 'td'...).
>
>    

It would work for any guest that uses rtc as its main clock source.

Anthony Liguori Sept. 9, 2009, 3:34 p.m. UTC | #12

Avi Kivity wrote:
> On 09/09/2009 04:31 PM, Jan Kiszka wrote:
>> 'drift-compensation-for-windows-guests' - this is at least what the
>> documentation suggests right now. I was looking for a short tag (but not
>> as short as 'td'...).
>>
>>    
>
> It would work for any guest that uses rtc as its main clock source.

First step is qdev conversion for the RTC.  You can then introduce a 
drift property.  I'd suggest something like drift=none|catchup|gradual.  
The default can be none.  We can also introduce a kvm machine type where 
the default is catchup.  Obviously, we don't support gradual today.

You can modify the default rtc settings using the -set operation (we can 
give the rtc a default id).  This lets localtime, td-hack, etc. all be 
aliases for -set.

Gleb Natapov Sept. 9, 2009, 4:09 p.m. UTC | #13

On Wed, Sep 09, 2009 at 10:34:54AM -0500, Anthony Liguori wrote:
> Avi Kivity wrote:
> >On 09/09/2009 04:31 PM, Jan Kiszka wrote:
> >>'drift-compensation-for-windows-guests' - this is at least what the
> >>documentation suggests right now. I was looking for a short tag (but not
> >>as short as 'td'...).
> >>
> >
> >It would work for any guest that uses rtc as its main clock source.
> 
> First step is qdev conversion for the RTC.  You can then introduce a
> drift property.  I'd suggest something like
> drift=none|catchup|gradual.  The default can be none.  We can also
> introduce a kvm machine type where the default is catchup.
> Obviously, we don't support gradual today.
> 
What is "catchup" and what is "gradual"?

--
			Gleb.

Anthony Liguori Sept. 9, 2009, 5:52 p.m. UTC | #14

Gleb Natapov wrote:
> What is "catchup" and what is "gradual"?
>   

Just placeholders to point out that there are different algorithms to 
replaying interrupts to a guest.  I called what we implemented "catchup" 
since we pretty much immediately send all of the missed ticks to the 
guest in order to catch-up.

I'd imagine that a 'gradual' algorithm would shorten the tick interval 
until you've caught up.  The Hyper-V paravirtualization manual has 
provisions for the guest to request what type of algorithm is used so it 
makes sense to accommodate this in our syntax.

Avi Kivity Sept. 9, 2009, 6:25 p.m. UTC | #15

On 09/09/2009 06:34 PM, Anthony Liguori wrote:
> First step is qdev conversion for the RTC.  You can then introduce a 
> drift property.  I'd suggest something like 
> drift=none|catchup|gradual.  The default can be none.  We can also 
> introduce a kvm machine type where the default is catchup.  Obviously, 
> we don't support gradual today.
>

Maybe none|step|slew to avoid alienating mayo lovers.

Should be a global option that applies to all clock sources?

Anthony Liguori Sept. 9, 2009, 6:33 p.m. UTC | #16

Avi Kivity wrote:
> On 09/09/2009 06:34 PM, Anthony Liguori wrote:
>> First step is qdev conversion for the RTC.  You can then introduce a 
>> drift property.  I'd suggest something like 
>> drift=none|catchup|gradual.  The default can be none.  We can also 
>> introduce a kvm machine type where the default is catchup.  
>> Obviously, we don't support gradual today.
>>
>
> Maybe none|step|slew to avoid alienating mayo lovers.
>
> Should be a global option that applies to all clock sources?

Should be an option for all clock devices.  And I think it makes sense 
to have a virtualization machine type that is automatically enabled when 
using kvm and that automatically enables drift=step.

Gleb Natapov Sept. 9, 2009, 6:40 p.m. UTC | #17

On Wed, Sep 09, 2009 at 12:52:12PM -0500, Anthony Liguori wrote:
> Gleb Natapov wrote:
> >What is "catchup" and what is "gradual"?
> 
> Just placeholders to point out that there are different algorithms
> to replaying interrupts to a guest.  I called what we implemented
> "catchup" since we pretty much immediately send all of the missed
> ticks to the guest in order to catch-up.
> 
> I'd imagine that a 'gradual' algorithm would shorten the tick
> interval until you've caught up.  The Hyper-V paravirtualization
> manual has provisions for the guest to request what type of
> algorithm is used so it makes sense to accommodate this in our
> syntax.
> 
We implement gradual then.

--
			Gleb.

Anthony Liguori Sept. 9, 2009, 6:50 p.m. UTC | #18

Gleb Natapov wrote:
> On Wed, Sep 09, 2009 at 12:52:12PM -0500, Anthony Liguori wrote:
>   
>> Gleb Natapov wrote:
>>     
>>> What is "catchup" and what is "gradual"?
>>>       
>> Just placeholders to point out that there are different algorithms
>> to replaying interrupts to a guest.  I called what we implemented
>> "catchup" since we pretty much immediately send all of the missed
>> ticks to the guest in order to catch-up.
>>
>> I'd imagine that a 'gradual' algorithm would shorten the tick
>> interval until you've caught up.  The Hyper-V paravirtualization
>> manual has provisions for the guest to request what type of
>> algorithm is used so it makes sense to accommodate this in our
>> syntax.
>>
>>     
> We implement gradual then.
>   

You're right.

Regards,

Anthony Liguori

re-set rtc date on reset handler

Commit Message

Comments

Patch