[v11,47/59] i386/xen: handle PV timer hypercalls

Message ID	20230216062444.2129371-48-dwmw2@infradead.org
State	New
Headers	show Return-Path: <qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org> From: David Woodhouse <dwmw2@infradead.org> To: Peter Maydell <peter.maydell@linaro.org>, qemu-devel@nongnu.org Cc: Paolo Bonzini <pbonzini@redhat.com>, Paul Durrant <paul@xen.org>, Joao Martins <joao.m.martins@oracle.com>, Ankur Arora <ankur.a.arora@oracle.com>, =?utf-8?q?Philippe_Mathieu-Daud?= =?utf-8?q?=C3=A9?= <philmd@linaro.org>, Thomas Huth <thuth@redhat.com>, =?utf-8?q?Alex_Benn=C3=A9e?= <alex.bennee@linaro.org>, Juan Quintela <quintela@redhat.com>, "Dr . David Alan Gilbert" <dgilbert@redhat.com>, Claudio Fontana <cfontana@suse.de>, Julien Grall <julien@xen.org>, "Michael S. Tsirkin" <mst@redhat.com>, Marcel Apfelbaum <marcel.apfelbaum@gmail.com>, armbru@redhat.com Subject: [PATCH v11 47/59] i386/xen: handle PV timer hypercalls Date: Thu, 16 Feb 2023 06:24:32 +0000 Message-Id: <20230216062444.2129371-48-dwmw2@infradead.org> In-Reply-To: <20230216062444.2129371-1-dwmw2@infradead.org> References: <20230216062444.2129371-1-dwmw2@infradead.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: none client-ip=2001:8b0:10b:1236::1; envelope-from=BATV+33d3adc5578b079b0cf9+7116+infradead.org+dwmw2@casper.srs.infradead.org; helo=casper.infradead.org X-Spam_score_int: -43 X-Spam_score: -4.4 X-Spam_bar: ---- X-Spam_report: (-4.4 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_MED=-2.3, SPF_HELO_NONE=0.001, SPF_NONE=0.001 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org Sender: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org
Series	Xen HVM support under KVM \| expand [v11,00/59] Xen HVM support under KVM [v11,01/59] include: import Xen public headers to hw/xen/interface [v11,02/59] xen: add CONFIG_XEN_BUS and CONFIG_XEN_EMU options for Xen emulation [v11,03/59] xen: Add XEN_DISABLED mode and make it default [v11,04/59] i386/kvm: Add xen-version KVM accelerator property and init KVM Xen support [v11,05/59] i386/kvm: handle Xen HVM cpuid leaves [v11,06/59] i386/hvm: Set Xen vCPU ID in KVM [v11,07/59] xen-platform: exclude vfio-pci from the PCI platform unplug [v11,08/59] xen-platform: allow its creation with XEN_EMULATE mode [v11,09/59] i386/xen: handle guest hypercalls [v11,10/59] i386/xen: implement HYPERVISOR_xen_version [v11,11/59] i386/xen: implement HYPERVISOR_sched_op, SCHEDOP_shutdown [v11,12/59] i386/xen: Implement SCHEDOP_poll and SCHEDOP_yield [v11,13/59] hw/xen: Add xen_overlay device for emulating shared xenheap pages [v11,14/59] xen: Permit --xen-domid argument when accel is KVM [v11,15/59] i386/xen: add pc_machine_kvm_type to initialize XEN_EMULATE mode [v11,16/59] i386/xen: manage and save/restore Xen guest long_mode setting [v11,17/59] i386/xen: implement HYPERVISOR_memory_op [v11,18/59] i386/xen: implement XENMEM_add_to_physmap_batch [v11,19/59] i386/xen: implement HYPERVISOR_hvm_op [v11,20/59] i386/xen: implement HYPERVISOR_vcpu_op [v11,21/59] i386/xen: handle VCPUOP_register_vcpu_info [v11,22/59] i386/xen: handle VCPUOP_register_vcpu_time_info [v11,23/59] i386/xen: handle VCPUOP_register_runstate_memory_area [v11,24/59] i386/xen: implement HYPERVISOR_event_channel_op [v11,25/59] i386/xen: implement HVMOP_set_evtchn_upcall_vector [v11,26/59] i386/xen: implement HVMOP_set_param [v11,27/59] hw/xen: Add xen_evtchn device for event channel emulation [v11,28/59] i386/xen: Add support for Xen event channel delivery to vCPU [v11,29/59] hw/xen: Implement EVTCHNOP_status [v11,30/59] hw/xen: Implement EVTCHNOP_close [v11,31/59] hw/xen: Implement EVTCHNOP_unmask [v11,32/59] hw/xen: Implement EVTCHNOP_bind_virq [v11,33/59] hw/xen: Implement EVTCHNOP_bind_ipi [v11,34/59] hw/xen: Implement EVTCHNOP_send [v11,35/59] hw/xen: Implement EVTCHNOP_alloc_unbound [v11,36/59] hw/xen: Implement EVTCHNOP_bind_interdomain [v11,37/59] hw/xen: Implement EVTCHNOP_bind_vcpu [v11,38/59] hw/xen: Implement EVTCHNOP_reset [v11,39/59] i386/xen: add monitor commands to test event injection [v11,40/59] hw/xen: Support HVM_PARAM_CALLBACK_TYPE_GSI callback [v11,41/59] hw/xen: Support HVM_PARAM_CALLBACK_TYPE_PCI_INTX callback [v11,42/59] kvm/i386: Add xen-gnttab-max-frames property [v11,43/59] hw/xen: Add xen_gnttab device for grant table emulation [v11,44/59] hw/xen: Support mapping grant frames [v11,45/59] i386/xen: Implement HYPERVISOR_grant_table_op and GNTTABOP_[gs]et_verson [v11,46/59] hw/xen: Implement GNTTABOP_query_size [v11,47/59] i386/xen: handle PV timer hypercalls [v11,48/59] i386/xen: Reserve Xen special pages for console, xenstore rings [v11,49/59] i386/xen: handle HVMOP_get_param [v11,50/59] hw/xen: Add backend implementation of interdomain event channel support [v11,51/59] hw/xen: Add xen_xenstore device for xenstore emulation [v11,52/59] hw/xen: Add basic ring handling to xenstore [v11,53/59] hw/xen: Automatically add xen-platform PCI device for emulated Xen guests [v11,54/59] i386/xen: Implement HYPERVISOR_physdev_op [v11,55/59] hw/xen: Implement emulated PIRQ hypercall support [v11,56/59] hw/xen: Support GSI mapping to PIRQ [v11,57/59] hw/xen: Support MSI mapping to PIRQ [v11,58/59] kvm/i386: Add xen-evtchn-max-pirq property [v11,59/59] i386/xen: Document Xen HVM emulation

Message ID

20230216062444.2129371-48-dwmw2@infradead.org

State

New

Headers

From: David Woodhouse <dwmw2@infradead.org>
To: Peter Maydell <peter.maydell@linaro.org>,
	qemu-devel@nongnu.org
Cc: Paolo Bonzini <pbonzini@redhat.com>, Paul Durrant <paul@xen.org>,
 Joao Martins <joao.m.martins@oracle.com>,
 Ankur Arora <ankur.a.arora@oracle.com>, =?utf-8?q?Philippe_Mathieu-Daud?=
	=?utf-8?q?=C3=A9?= <philmd@linaro.org>, Thomas Huth <thuth@redhat.com>,
	=?utf-8?q?Alex_Benn=C3=A9e?= <alex.bennee@linaro.org>,
 Juan Quintela <quintela@redhat.com>,
 "Dr . David Alan Gilbert" <dgilbert@redhat.com>,
 Claudio Fontana <cfontana@suse.de>, Julien Grall <julien@xen.org>,
 "Michael S. Tsirkin" <mst@redhat.com>,
 Marcel Apfelbaum <marcel.apfelbaum@gmail.com>, armbru@redhat.com
Subject: [PATCH v11 47/59] i386/xen: handle PV timer hypercalls
Date: Thu, 16 Feb 2023 06:24:32 +0000
Message-Id: <20230216062444.2129371-48-dwmw2@infradead.org>
In-Reply-To: <20230216062444.2129371-1-dwmw2@infradead.org>
References: <20230216062444.2129371-1-dwmw2@infradead.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: none client-ip=2001:8b0:10b:1236::1;
 envelope-from=BATV+33d3adc5578b079b0cf9+7116+infradead.org+dwmw2@casper.srs.infradead.org;
 helo=casper.infradead.org
X-Spam_score_int: -43
X-Spam_score: -4.4
X-Spam_bar: ----
X-Spam_report: (-4.4 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_MED=-2.3, SPF_HELO_NONE=0.001,
 SPF_NONE=0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org
Sender: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org

Series

Xen HVM support under KVM | expand

Commit Message

David Woodhouse Feb. 16, 2023, 6:24 a.m. UTC

From: Joao Martins <joao.m.martins@oracle.com>

Introduce support for one shot and periodic mode of Xen PV timers,
whereby timer interrupts come through a special virq event channel
with deadlines being set through:

1) set_timer_op hypercall (only oneshot)
2) vcpu_op hypercall for {set,stop}_{singleshot,periodic}_timer
hypercalls

Signed-off-by: Joao Martins <joao.m.martins@oracle.com>
Signed-off-by: David Woodhouse <dwmw@amazon.co.uk>
---
 hw/i386/kvm/xen_evtchn.c  |  31 +++++
 hw/i386/kvm/xen_evtchn.h  |   2 +
 target/i386/cpu.h         |   5 +
 target/i386/kvm/xen-emu.c | 252 +++++++++++++++++++++++++++++++++++++-
 target/i386/machine.c     |   1 +
 5 files changed, 289 insertions(+), 2 deletions(-)

Comments

Durrant, Paul Feb. 20, 2023, 2:29 p.m. UTC | #1

On 16/02/2023 06:24, David Woodhouse wrote:
> From: Joao Martins <joao.m.martins@oracle.com>
> 
> Introduce support for one shot and periodic mode of Xen PV timers,
> whereby timer interrupts come through a special virq event channel
> with deadlines being set through:
> 
> 1) set_timer_op hypercall (only oneshot)
> 2) vcpu_op hypercall for {set,stop}_{singleshot,periodic}_timer
> hypercalls
> 
> Signed-off-by: Joao Martins <joao.m.martins@oracle.com>
> Signed-off-by: David Woodhouse <dwmw@amazon.co.uk>
> ---
>   hw/i386/kvm/xen_evtchn.c  |  31 +++++
>   hw/i386/kvm/xen_evtchn.h  |   2 +
>   target/i386/cpu.h         |   5 +
>   target/i386/kvm/xen-emu.c | 252 +++++++++++++++++++++++++++++++++++++-
>   target/i386/machine.c     |   1 +
>   5 files changed, 289 insertions(+), 2 deletions(-)
> 
[snip]
>   static bool kvm_xen_hcall_vcpu_op(struct kvm_xen_exit *exit, X86CPU *cpu,
>                                     int cmd, int vcpu_id, uint64_t arg)
>   {
> -    CPUState *dest = qemu_get_cpu(vcpu_id);
>       CPUState *cs = CPU(cpu);
> +    CPUState *dest = cs->cpu_index == vcpu_id ? cs : qemu_get_cpu(vcpu_id);
>       int err;
>   
> +    if (!dest) {
> +        return -ENOENT;
> +    }
> +

I thought the patch format was catching me out somehow but I don't think 
so...

The function declaration says 'static bool kvm_xen_hcall_vcpu_op(...)' 
but that return value doesn't look very boolean to me. I think you also 
have the same issue...

>       switch (cmd) {
>       case VCPUOP_register_runstate_memory_area:
>           err = vcpuop_register_runstate_info(cs, dest, arg);
> @@ -892,6 +1092,26 @@ static bool kvm_xen_hcall_vcpu_op(struct kvm_xen_exit *exit, X86CPU *cpu,
>       case VCPUOP_register_vcpu_info:
>           err = vcpuop_register_vcpu_info(cs, dest, arg);
>           break;
> +    case VCPUOP_set_singleshot_timer: {
> +        if (cs->cpu_index != vcpu_id) {
> +            return -EINVAL;
> +        }
> +        err = vcpuop_set_singleshot_timer(dest, arg);
> +        break;
> +    }
> +    case VCPUOP_stop_singleshot_timer:
> +        if (cs->cpu_index != vcpu_id) {
> +            return -EINVAL;
> +        }
> +        err = vcpuop_stop_singleshot_timer(dest);
> +        break;
> +    case VCPUOP_set_periodic_timer: {
> +        err = vcpuop_set_periodic_timer(cs, dest, arg);
> +        break;
> +    }
> +    case VCPUOP_stop_periodic_timer:
> +        err = vcpuop_stop_periodic_timer(dest);
> +        break;
>   
>       default:
>           return false;
> @@ -1246,6 +1466,16 @@ static bool do_kvm_xen_handle_exit(X86CPU *cpu, struct kvm_xen_exit *exit)
>       }
>   
>       switch (code) {
> +    case __HYPERVISOR_set_timer_op:
> +        if (exit->u.hcall.longmode) {
> +            return kvm_xen_hcall_set_timer_op(exit, cpu,
> +                                              exit->u.hcall.params[0]);
> +        } else {
> +            /* In 32-bit mode, the 64-bit timer value is in two args. */
> +            uint64_t val = ((uint64_t)exit->u.hcall.params[1]) << 32 |
> +                (uint32_t)exit->u.hcall.params[0];
> +            return kvm_xen_hcall_set_timer_op(exit, cpu, val);
> +        }

... with these returns above.

   Paul

>       case __HYPERVISOR_grant_table_op:
>           return kvm_xen_hcall_gnttab_op(exit, cpu, exit->u.hcall.params[0],
>                                          exit->u.hcall.params[1],
> @@ -1355,7 +1585,25 @@ int kvm_put_xen_state(CPUState *cs)
>           }
>       }
>   
> +    if (env->xen_periodic_timer_period) {
> +        ret = do_set_periodic_timer(cs, env->xen_periodic_timer_period);
> +        if (ret < 0) {
> +            return ret;
> +        }
> +    }
> +
>       if (!kvm_xen_has_cap(EVTCHN_SEND)) {
> +        /*
> +         * If the kernel has EVTCHN_SEND support then it handles timers too,
> +         * so the timer will be restored by kvm_xen_set_vcpu_timer() below.
> +         */
> +        if (env->xen_singleshot_timer_ns) {
> +            ret = do_set_singleshot_timer(cs, env->xen_singleshot_timer_ns,
> +                                    false, false);
> +            if (ret < 0) {
> +                return ret;
> +            }
> +        }
>           return 0;
>       }
>   
> diff --git a/target/i386/machine.c b/target/i386/machine.c
> index 603a1077e3..c7ac8084b2 100644
> --- a/target/i386/machine.c
> +++ b/target/i386/machine.c
> @@ -1277,6 +1277,7 @@ static const VMStateDescription vmstate_xen_vcpu = {
>           VMSTATE_UINT8(env.xen_vcpu_callback_vector, X86CPU),
>           VMSTATE_UINT16_ARRAY(env.xen_virq, X86CPU, XEN_NR_VIRQS),
>           VMSTATE_UINT64(env.xen_singleshot_timer_ns, X86CPU),
> +        VMSTATE_UINT64(env.xen_periodic_timer_period, X86CPU),
>           VMSTATE_END_OF_LIST()
>       }
>   };

David Woodhouse Feb. 20, 2023, 3:49 p.m. UTC | #2

On Mon, 2023-02-20 at 14:29 +0000, Paul Durrant wrote:
> [snip]
> >    static bool kvm_xen_hcall_vcpu_op(struct kvm_xen_exit *exit, X86CPU *cpu,
> >                                      int cmd, int vcpu_id, uint64_t arg)
> >    {
> > -    CPUState *dest = qemu_get_cpu(vcpu_id);
> >        CPUState *cs = CPU(cpu);
> > +    CPUState *dest = cs->cpu_index == vcpu_id ? cs : qemu_get_cpu(vcpu_id);
> >        int err;
> >    
> > +    if (!dest) {
> > +        return -ENOENT;
> > +    }
> > +
> 
> I thought the patch format was catching me out somehow but I don't think 
> so...
> 
> The function declaration says 'static bool kvm_xen_hcall_vcpu_op(...)' 
> but that return value doesn't look very boolean to me. I think you also 
> have the same issue...

Ah, good catch. Thanks! Those additional checks were added later.

But why in $DEITY's name did the compiler not catch that? That almost
makes me reconsider my life choices in having that as the function
API... but this is basically never going to need to change so I think
it's OK. I'll fix it and move on. There are plenty of other choices
I've made in my life which are far more worthy of second-guessing...

diff --git a/hw/i386/kvm/xen_evtchn.c b/hw/i386/kvm/xen_evtchn.c
index 5d5996641d..06572b3e10 100644
--- a/hw/i386/kvm/xen_evtchn.c
+++ b/hw/i386/kvm/xen_evtchn.c
@@ -1220,6 +1220,37 @@  int xen_evtchn_send_op(struct evtchn_send *send)
     return ret;
 }
 
+int xen_evtchn_set_port(uint16_t port)
+{
+    XenEvtchnState *s = xen_evtchn_singleton;
+    XenEvtchnPort *p;
+    int ret = -EINVAL;
+
+    if (!s) {
+        return -ENOTSUP;
+    }
+
+    if (!valid_port(port)) {
+        return -EINVAL;
+    }
+
+    qemu_mutex_lock(&s->port_lock);
+
+    p = &s->port_table[port];
+
+    /* QEMU has no business sending to anything but these */
+    if (p->type == EVTCHNSTAT_virq ||
+        (p->type == EVTCHNSTAT_interdomain &&
+         (p->type_val & PORT_INFO_TYPEVAL_REMOTE_QEMU))) {
+        set_port_pending(s, port);
+        ret = 0;
+    }
+
+    qemu_mutex_unlock(&s->port_lock);
+
+    return ret;
+}
+
 EvtchnInfoList *qmp_xen_event_list(Error **errp)
 {
     XenEvtchnState *s = xen_evtchn_singleton;
diff --git a/hw/i386/kvm/xen_evtchn.h b/hw/i386/kvm/xen_evtchn.h
index b03c3108bc..24611478b8 100644
--- a/hw/i386/kvm/xen_evtchn.h
+++ b/hw/i386/kvm/xen_evtchn.h
@@ -20,6 +20,8 @@  int xen_evtchn_set_callback_param(uint64_t param);
 void xen_evtchn_connect_gsis(qemu_irq *system_gsis);
 void xen_evtchn_set_callback_level(int level);
 
+int xen_evtchn_set_port(uint16_t port);
+
 struct evtchn_status;
 struct evtchn_close;
 struct evtchn_unmask;
diff --git a/target/i386/cpu.h b/target/i386/cpu.h
index e8718c31e5..b579f0f0f8 100644
--- a/target/i386/cpu.h
+++ b/target/i386/cpu.h
@@ -26,6 +26,7 @@ 
 #include "exec/cpu-defs.h"
 #include "qapi/qapi-types-common.h"
 #include "qemu/cpu-float.h"
+#include "qemu/timer.h"
 
 #define XEN_NR_VIRQS 24
 
@@ -1800,6 +1801,10 @@  typedef struct CPUArchState {
     bool xen_callback_asserted;
     uint16_t xen_virq[XEN_NR_VIRQS];
     uint64_t xen_singleshot_timer_ns;
+    QEMUTimer *xen_singleshot_timer;
+    uint64_t xen_periodic_timer_period;
+    QEMUTimer *xen_periodic_timer;
+    QemuMutex xen_timers_lock;
 #endif
 #if defined(CONFIG_HVF)
     HVFX86LazyFlags hvf_lflags;
diff --git a/target/i386/kvm/xen-emu.c b/target/i386/kvm/xen-emu.c
index 44fa0de784..4781b1fa97 100644
--- a/target/i386/kvm/xen-emu.c
+++ b/target/i386/kvm/xen-emu.c
@@ -38,6 +38,9 @@ 
 
 #include "xen-compat.h"
 
+static void xen_vcpu_singleshot_timer_event(void *opaque);
+static void xen_vcpu_periodic_timer_event(void *opaque);
+
 #ifdef TARGET_X86_64
 #define hypercall_compat32(longmode) (!(longmode))
 #else
@@ -201,6 +204,23 @@  int kvm_xen_init_vcpu(CPUState *cs)
     env->xen_vcpu_time_info_gpa = INVALID_GPA;
     env->xen_vcpu_runstate_gpa = INVALID_GPA;
 
+    qemu_mutex_init(&env->xen_timers_lock);
+    env->xen_singleshot_timer = timer_new_ns(QEMU_CLOCK_VIRTUAL,
+                                             xen_vcpu_singleshot_timer_event,
+                                             cpu);
+    if (!env->xen_singleshot_timer) {
+        return -ENOMEM;
+    }
+    env->xen_singleshot_timer->opaque = cs;
+
+    env->xen_periodic_timer = timer_new_ns(QEMU_CLOCK_VIRTUAL,
+                                           xen_vcpu_periodic_timer_event,
+                                           cpu);
+    if (!env->xen_periodic_timer) {
+        return -ENOMEM;
+    }
+    env->xen_periodic_timer->opaque = cs;
+
     return 0;
 }
 
@@ -232,7 +252,8 @@  static bool kvm_xen_hcall_xen_version(struct kvm_xen_exit *exit, X86CPU *cpu,
                          1 << XENFEAT_writable_descriptor_tables |
                          1 << XENFEAT_auto_translated_physmap |
                          1 << XENFEAT_supervisor_mode_kernel |
-                         1 << XENFEAT_hvm_callback_vector;
+                         1 << XENFEAT_hvm_callback_vector |
+                         1 << XENFEAT_hvm_safe_pvclock;
         }
 
         err = kvm_copy_to_gva(CPU(cpu), arg, &fi, sizeof(fi));
@@ -875,13 +896,192 @@  static int vcpuop_register_runstate_info(CPUState *cs, CPUState *target,
     return 0;
 }
 
+static uint64_t kvm_get_current_ns(void)
+{
+    struct kvm_clock_data data;
+    int ret;
+
+    ret = kvm_vm_ioctl(kvm_state, KVM_GET_CLOCK, &data);
+    if (ret < 0) {
+        fprintf(stderr, "KVM_GET_CLOCK failed: %s\n", strerror(ret));
+                abort();
+    }
+
+    return data.clock;
+}
+
+static void xen_vcpu_singleshot_timer_event(void *opaque)
+{
+    CPUState *cpu = opaque;
+    CPUX86State *env = &X86_CPU(cpu)->env;
+    uint16_t port = env->xen_virq[VIRQ_TIMER];
+
+    if (likely(port)) {
+        xen_evtchn_set_port(port);
+    }
+
+    qemu_mutex_lock(&env->xen_timers_lock);
+    env->xen_singleshot_timer_ns = 0;
+    qemu_mutex_unlock(&env->xen_timers_lock);
+}
+
+static void xen_vcpu_periodic_timer_event(void *opaque)
+{
+    CPUState *cpu = opaque;
+    CPUX86State *env = &X86_CPU(cpu)->env;
+    uint16_t port = env->xen_virq[VIRQ_TIMER];
+    int64_t qemu_now;
+
+    if (likely(port)) {
+        xen_evtchn_set_port(port);
+    }
+
+    qemu_mutex_lock(&env->xen_timers_lock);
+
+    qemu_now = qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL);
+    timer_mod_ns(env->xen_periodic_timer,
+                 qemu_now + env->xen_periodic_timer_period);
+
+    qemu_mutex_unlock(&env->xen_timers_lock);
+}
+
+static int do_set_periodic_timer(CPUState *target, uint64_t period_ns)
+{
+    CPUX86State *tenv = &X86_CPU(target)->env;
+    int64_t qemu_now;
+
+    timer_del(tenv->xen_periodic_timer);
+
+    qemu_mutex_lock(&tenv->xen_timers_lock);
+
+    qemu_now = qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL);
+    timer_mod_ns(tenv->xen_periodic_timer, qemu_now + period_ns);
+    tenv->xen_periodic_timer_period = period_ns;
+
+    qemu_mutex_unlock(&tenv->xen_timers_lock);
+    return 0;
+}
+
+#define MILLISECS(_ms)  ((int64_t)((_ms) * 1000000ULL))
+#define MICROSECS(_us)  ((int64_t)((_us) * 1000ULL))
+#define STIME_MAX ((time_t)((int64_t)~0ull >> 1))
+/* Chosen so (NOW() + delta) wont overflow without an uptime of 200 years */
+#define STIME_DELTA_MAX ((int64_t)((uint64_t)~0ull >> 2))
+
+static int vcpuop_set_periodic_timer(CPUState *cs, CPUState *target,
+                                     uint64_t arg)
+{
+    struct vcpu_set_periodic_timer spt;
+
+    qemu_build_assert(sizeof(spt) == 8);
+    if (kvm_copy_from_gva(cs, arg, &spt, sizeof(spt))) {
+        return -EFAULT;
+    }
+
+    if (spt.period_ns < MILLISECS(1) || spt.period_ns > STIME_DELTA_MAX) {
+        return -EINVAL;
+    }
+
+    return do_set_periodic_timer(target, spt.period_ns);
+}
+
+static int vcpuop_stop_periodic_timer(CPUState *target)
+{
+    CPUX86State *tenv = &X86_CPU(target)->env;
+
+    qemu_mutex_lock(&tenv->xen_timers_lock);
+
+    timer_del(tenv->xen_periodic_timer);
+    tenv->xen_periodic_timer_period = 0;
+
+    qemu_mutex_unlock(&tenv->xen_timers_lock);
+    return 0;
+}
+
+static int do_set_singleshot_timer(CPUState *cs, uint64_t timeout_abs,
+                                   bool future, bool linux_wa)
+{
+    CPUX86State *env = &X86_CPU(cs)->env;
+    int64_t now = kvm_get_current_ns();
+    int64_t qemu_now = qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL);
+    int64_t delta = timeout_abs - now;
+
+    if (future && timeout_abs < now) {
+        return -ETIME;
+    }
+
+    if (linux_wa && unlikely((int64_t)timeout_abs < 0 ||
+                             (delta > 0 && (uint32_t)(delta >> 50) != 0))) {
+        /*
+         * Xen has a 'Linux workaround' in do_set_timer_op() which checks
+         * for negative absolute timeout values (caused by integer
+         * overflow), and for values about 13 days in the future (2^50ns)
+         * which would be caused by jiffies overflow. For those cases, it
+         * sets the timeout 100ms in the future (not *too* soon, since if
+         * a guest really did set a long timeout on purpose we don't want
+         * to keep churning CPU time by waking it up).
+         */
+        delta = (100 * SCALE_MS);
+        timeout_abs = now + delta;
+    }
+
+    qemu_mutex_lock(&env->xen_timers_lock);
+
+    timer_mod_ns(env->xen_singleshot_timer, qemu_now + delta);
+    env->xen_singleshot_timer_ns = now + delta;
+
+    qemu_mutex_unlock(&env->xen_timers_lock);
+    return 0;
+}
+
+static int vcpuop_set_singleshot_timer(CPUState *cs, uint64_t arg)
+{
+    struct vcpu_set_singleshot_timer sst;
+
+    qemu_build_assert(sizeof(sst) == 16);
+    if (kvm_copy_from_gva(cs, arg, &sst, sizeof(sst))) {
+        return -EFAULT;
+    }
+
+    return do_set_singleshot_timer(cs, sst.timeout_abs_ns,
+                                   !!(sst.flags & VCPU_SSHOTTMR_future),
+                                   false);
+}
+
+static int vcpuop_stop_singleshot_timer(CPUState *cs)
+{
+    CPUX86State *env = &X86_CPU(cs)->env;
+
+    qemu_mutex_lock(&env->xen_timers_lock);
+
+    timer_del(env->xen_singleshot_timer);
+    env->xen_singleshot_timer_ns = 0;
+
+    qemu_mutex_unlock(&env->xen_timers_lock);
+    return 0;
+}
+
+static int kvm_xen_hcall_set_timer_op(struct kvm_xen_exit *exit, X86CPU *cpu,
+                                      uint64_t timeout)
+{
+    if (unlikely(timeout == 0)) {
+        return vcpuop_stop_singleshot_timer(CPU(cpu));
+    } else {
+        return do_set_singleshot_timer(CPU(cpu), timeout, false, true);
+    }
+}
+
 static bool kvm_xen_hcall_vcpu_op(struct kvm_xen_exit *exit, X86CPU *cpu,
                                   int cmd, int vcpu_id, uint64_t arg)
 {
-    CPUState *dest = qemu_get_cpu(vcpu_id);
     CPUState *cs = CPU(cpu);
+    CPUState *dest = cs->cpu_index == vcpu_id ? cs : qemu_get_cpu(vcpu_id);
     int err;
 
+    if (!dest) {
+        return -ENOENT;
+    }
+
     switch (cmd) {
     case VCPUOP_register_runstate_memory_area:
         err = vcpuop_register_runstate_info(cs, dest, arg);
@@ -892,6 +1092,26 @@  static bool kvm_xen_hcall_vcpu_op(struct kvm_xen_exit *exit, X86CPU *cpu,
     case VCPUOP_register_vcpu_info:
         err = vcpuop_register_vcpu_info(cs, dest, arg);
         break;
+    case VCPUOP_set_singleshot_timer: {
+        if (cs->cpu_index != vcpu_id) {
+            return -EINVAL;
+        }
+        err = vcpuop_set_singleshot_timer(dest, arg);
+        break;
+    }
+    case VCPUOP_stop_singleshot_timer:
+        if (cs->cpu_index != vcpu_id) {
+            return -EINVAL;
+        }
+        err = vcpuop_stop_singleshot_timer(dest);
+        break;
+    case VCPUOP_set_periodic_timer: {
+        err = vcpuop_set_periodic_timer(cs, dest, arg);
+        break;
+    }
+    case VCPUOP_stop_periodic_timer:
+        err = vcpuop_stop_periodic_timer(dest);
+        break;
 
     default:
         return false;
@@ -1246,6 +1466,16 @@  static bool do_kvm_xen_handle_exit(X86CPU *cpu, struct kvm_xen_exit *exit)
     }
 
     switch (code) {
+    case __HYPERVISOR_set_timer_op:
+        if (exit->u.hcall.longmode) {
+            return kvm_xen_hcall_set_timer_op(exit, cpu,
+                                              exit->u.hcall.params[0]);
+        } else {
+            /* In 32-bit mode, the 64-bit timer value is in two args. */
+            uint64_t val = ((uint64_t)exit->u.hcall.params[1]) << 32 |
+                (uint32_t)exit->u.hcall.params[0];
+            return kvm_xen_hcall_set_timer_op(exit, cpu, val);
+        }
     case __HYPERVISOR_grant_table_op:
         return kvm_xen_hcall_gnttab_op(exit, cpu, exit->u.hcall.params[0],
                                        exit->u.hcall.params[1],
@@ -1355,7 +1585,25 @@  int kvm_put_xen_state(CPUState *cs)
         }
     }
 
+    if (env->xen_periodic_timer_period) {
+        ret = do_set_periodic_timer(cs, env->xen_periodic_timer_period);
+        if (ret < 0) {
+            return ret;
+        }
+    }
+
     if (!kvm_xen_has_cap(EVTCHN_SEND)) {
+        /*
+         * If the kernel has EVTCHN_SEND support then it handles timers too,
+         * so the timer will be restored by kvm_xen_set_vcpu_timer() below.
+         */
+        if (env->xen_singleshot_timer_ns) {
+            ret = do_set_singleshot_timer(cs, env->xen_singleshot_timer_ns,
+                                    false, false);
+            if (ret < 0) {
+                return ret;
+            }
+        }
         return 0;
     }
 
diff --git a/target/i386/machine.c b/target/i386/machine.c
index 603a1077e3..c7ac8084b2 100644
--- a/target/i386/machine.c
+++ b/target/i386/machine.c
@@ -1277,6 +1277,7 @@  static const VMStateDescription vmstate_xen_vcpu = {
         VMSTATE_UINT8(env.xen_vcpu_callback_vector, X86CPU),
         VMSTATE_UINT16_ARRAY(env.xen_virq, X86CPU, XEN_NR_VIRQS),
         VMSTATE_UINT64(env.xen_singleshot_timer_ns, X86CPU),
+        VMSTATE_UINT64(env.xen_periodic_timer_period, X86CPU),
         VMSTATE_END_OF_LIST()
     }
 };

[v11,47/59] i386/xen: handle PV timer hypercalls

Commit Message

Comments

Patch