Patch Detail
get:
Show a patch.
patch:
Update a patch.
put:
Update a patch.
GET /api/patches/812238/?format=api
{ "id": 812238, "url": "http://patchwork.ozlabs.org/api/patches/812238/?format=api", "web_url": "http://patchwork.ozlabs.org/project/qemu-devel/patch/1505202976-1784-5-git-send-email-changpeng.liu@intel.com/", "project": { "id": 14, "url": "http://patchwork.ozlabs.org/api/projects/14/?format=api", "name": "QEMU Development", "link_name": "qemu-devel", "list_id": "qemu-devel.nongnu.org", "list_email": "qemu-devel@nongnu.org", "web_url": "", "scm_url": "", "webscm_url": "", "list_archive_url": "", "list_archive_url_format": "", "commit_url_format": "" }, "msgid": "<1505202976-1784-5-git-send-email-changpeng.liu@intel.com>", "list_archive_url": null, "date": "2017-09-12T07:56:16", "name": "[v3,4/4] contrib/vhost-user-blk: introduce a vhost-user-blk sample application", "commit_ref": null, "pull_url": null, "state": "new", "archived": false, "hash": "33a9bb892b7fdbcdf1bfeea1400b8a56d3035d58", "submitter": { "id": 71275, "url": "http://patchwork.ozlabs.org/api/people/71275/?format=api", "name": "Liu, Changpeng", "email": "changpeng.liu@intel.com" }, "delegate": null, "mbox": "http://patchwork.ozlabs.org/project/qemu-devel/patch/1505202976-1784-5-git-send-email-changpeng.liu@intel.com/mbox/", "series": [ { "id": 2447, "url": "http://patchwork.ozlabs.org/api/series/2447/?format=api", "web_url": "http://patchwork.ozlabs.org/project/qemu-devel/list/?series=2447", "date": "2017-09-12T07:56:15", "name": "*** Introduce a new vhost-user-blk host device to Qemu ***", "version": 3, "mbox": "http://patchwork.ozlabs.org/series/2447/mbox/" } ], "comments": "http://patchwork.ozlabs.org/api/patches/812238/comments/", "check": "pending", "checks": "http://patchwork.ozlabs.org/api/patches/812238/checks/", "tags": {}, "related": [], "headers": { "Return-Path": "<qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org>", "X-Original-To": "incoming@patchwork.ozlabs.org", "Delivered-To": "patchwork-incoming@bilbo.ozlabs.org", "Authentication-Results": "ozlabs.org;\n\tspf=pass (mailfrom) smtp.mailfrom=nongnu.org\n\t(client-ip=2001:4830:134:3::11; helo=lists.gnu.org;\n\tenvelope-from=qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org;\n\treceiver=<UNKNOWN>)", "Received": [ "from lists.gnu.org (lists.gnu.org [IPv6:2001:4830:134:3::11])\n\t(using TLSv1 with cipher AES256-SHA (256/256 bits))\n\t(No client certificate requested)\n\tby ozlabs.org (Postfix) with ESMTPS id 3xrKZ41jDNz9s83\n\tfor <incoming@patchwork.ozlabs.org>;\n\tMon, 11 Sep 2017 17:39:24 +1000 (AEST)", "from localhost ([::1]:55955 helo=lists.gnu.org)\n\tby lists.gnu.org with esmtp (Exim 4.71) (envelope-from\n\t<qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org>)\n\tid 1drJJS-0003E0-CU\n\tfor incoming@patchwork.ozlabs.org; Mon, 11 Sep 2017 03:39:22 -0400", "from eggs.gnu.org ([2001:4830:134:3::10]:43401)\n\tby lists.gnu.org with esmtp (Exim 4.71)\n\t(envelope-from <changpeng.liu@intel.com>) id 1drJGX-0001Di-3Z\n\tfor qemu-devel@nongnu.org; Mon, 11 Sep 2017 03:36:23 -0400", "from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)\n\t(envelope-from <changpeng.liu@intel.com>) id 1drJGT-0007ea-Np\n\tfor qemu-devel@nongnu.org; Mon, 11 Sep 2017 03:36:21 -0400", "from mga05.intel.com ([192.55.52.43]:18824)\n\tby eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32)\n\t(Exim 4.71) (envelope-from <changpeng.liu@intel.com>)\n\tid 1drJGT-0007cA-EU\n\tfor qemu-devel@nongnu.org; Mon, 11 Sep 2017 03:36:17 -0400", "from orsmga004.jf.intel.com ([10.7.209.38])\n\tby fmsmga105.fm.intel.com with ESMTP; 11 Sep 2017 00:36:14 -0700", "from fedora.sh.intel.com ([10.67.112.210])\n\tby orsmga004.jf.intel.com with ESMTP; 11 Sep 2017 00:36:11 -0700" ], "X-ExtLoop1": "1", "X-IronPort-AV": "E=Sophos;i=\"5.42,376,1500966000\"; d=\"scan'208\";a=\"127495102\"", "From": "Changpeng Liu <changpeng.liu@intel.com>", "To": "changpeng.liu@intel.com,\n\tqemu-devel@nongnu.org", "Date": "Tue, 12 Sep 2017 15:56:16 +0800", "Message-Id": "<1505202976-1784-5-git-send-email-changpeng.liu@intel.com>", "X-Mailer": "git-send-email 1.9.3", "In-Reply-To": "<1505202976-1784-1-git-send-email-changpeng.liu@intel.com>", "References": "<1505202976-1784-1-git-send-email-changpeng.liu@intel.com>", "X-detected-operating-system": "by eggs.gnu.org: Genre and OS details not\n\trecognized.", "X-Received-From": "192.55.52.43", "Subject": "[Qemu-devel] [PATCH v3 4/4] contrib/vhost-user-blk: introduce a\n\tvhost-user-blk sample application", "X-BeenThere": "qemu-devel@nongnu.org", "X-Mailman-Version": "2.1.21", "Precedence": "list", "List-Id": "<qemu-devel.nongnu.org>", "List-Unsubscribe": "<https://lists.nongnu.org/mailman/options/qemu-devel>,\n\t<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>", "List-Archive": "<http://lists.nongnu.org/archive/html/qemu-devel/>", "List-Post": "<mailto:qemu-devel@nongnu.org>", "List-Help": "<mailto:qemu-devel-request@nongnu.org?subject=help>", "List-Subscribe": "<https://lists.nongnu.org/mailman/listinfo/qemu-devel>,\n\t<mailto:qemu-devel-request@nongnu.org?subject=subscribe>", "Cc": "james.r.harris@intel.com, mst@redhat.com, stefanha@gmail.com,\n\tpbonzini@redhat.com, felipe@nutanix.com, marcandre.lureau@redhat.com", "Errors-To": "qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org", "Sender": "\"Qemu-devel\"\n\t<qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org>" }, "content": "This commit introcudes a vhost-user-blk backend device, it uses UNIX\ndomain socket to communicate with Qemu. The vhost-user-blk sample\napplication should be used with Qemu vhost-user-blk-pci device.\n\nTo use it, complie with:\nmake vhost-user-blk\n\nand start like this:\nvhost-user-blk -b /dev/sdb -s /path/vhost.socket\n\nSigned-off-by: Changpeng Liu <changpeng.liu@intel.com>\n---\n .gitignore | 1 +\n Makefile | 3 +\n Makefile.objs | 2 +\n contrib/vhost-user-blk/Makefile.objs | 1 +\n contrib/vhost-user-blk/vhost-user-blk.c | 735 ++++++++++++++++++++++++++++++++\n 5 files changed, 742 insertions(+)\n create mode 100644 contrib/vhost-user-blk/Makefile.objs\n create mode 100644 contrib/vhost-user-blk/vhost-user-blk.c", "diff": "diff --git a/.gitignore b/.gitignore\nindex cf65316..dbe5c13 100644\n--- a/.gitignore\n+++ b/.gitignore\n@@ -51,6 +51,7 @@\n /module_block.h\n /vscclient\n /vhost-user-scsi\n+/vhost-user-blk\n /fsdev/virtfs-proxy-helper\n *.[1-9]\n *.a\ndiff --git a/Makefile b/Makefile\nindex 337a1f6..91782b0 100644\n--- a/Makefile\n+++ b/Makefile\n@@ -270,6 +270,7 @@ dummy := $(call unnest-vars,, \\\n ivshmem-server-obj-y \\\n libvhost-user-obj-y \\\n vhost-user-scsi-obj-y \\\n+ vhost-user-blk-obj-y \\\n qga-vss-dll-obj-y \\\n block-obj-y \\\n block-obj-m \\\n@@ -485,6 +486,8 @@ ivshmem-server$(EXESUF): $(ivshmem-server-obj-y) $(COMMON_LDADDS)\n endif\n vhost-user-scsi$(EXESUF): $(vhost-user-scsi-obj-y)\n \t$(call LINK, $^)\n+vhost-user-blk$(EXESUF): $(vhost-user-blk-obj-y)\n+\t$(call LINK, $^)\n \n module_block.h: $(SRC_PATH)/scripts/modules/module_block.py config-host.mak\n \t$(call quiet-command,$(PYTHON) $< $@ \\\ndiff --git a/Makefile.objs b/Makefile.objs\nindex 24a4ea0..6b81548 100644\n--- a/Makefile.objs\n+++ b/Makefile.objs\n@@ -114,6 +114,8 @@ vhost-user-scsi.o-cflags := $(LIBISCSI_CFLAGS)\n vhost-user-scsi.o-libs := $(LIBISCSI_LIBS)\n vhost-user-scsi-obj-y = contrib/vhost-user-scsi/\n vhost-user-scsi-obj-y += contrib/libvhost-user/libvhost-user.o\n+vhost-user-blk-obj-y = contrib/vhost-user-blk/\n+vhost-user-blk-obj-y += contrib/libvhost-user/libvhost-user.o\n \n ######################################################################\n trace-events-subdirs =\ndiff --git a/contrib/vhost-user-blk/Makefile.objs b/contrib/vhost-user-blk/Makefile.objs\nnew file mode 100644\nindex 0000000..72e2cdc\n--- /dev/null\n+++ b/contrib/vhost-user-blk/Makefile.objs\n@@ -0,0 +1 @@\n+vhost-user-blk-obj-y = vhost-user-blk.o\ndiff --git a/contrib/vhost-user-blk/vhost-user-blk.c b/contrib/vhost-user-blk/vhost-user-blk.c\nnew file mode 100644\nindex 0000000..9b90164\n--- /dev/null\n+++ b/contrib/vhost-user-blk/vhost-user-blk.c\n@@ -0,0 +1,735 @@\n+/*\n+ * vhost-user-blk sample application\n+ *\n+ * Copyright IBM, Corp. 2007\n+ * Copyright (c) 2016 Nutanix Inc. All rights reserved.\n+ * Copyright (c) 2017 Intel Corporation. All rights reserved.\n+ *\n+ * Author:\n+ * Anthony Liguori <aliguori@us.ibm.com>\n+ * Felipe Franciosi <felipe@nutanix.com>\n+ * Changpeng Liu <changpeng.liu@intel.com>\n+ *\n+ * This work is licensed under the terms of the GNU GPL, version 2 only.\n+ * See the COPYING file in the top-level directory.\n+ */\n+\n+#include \"qemu/osdep.h\"\n+#include \"hw/virtio/virtio-blk.h\"\n+#include \"contrib/libvhost-user/libvhost-user.h\"\n+\n+#include <glib.h>\n+\n+/* Small compat shim from glib 2.32 */\n+#ifndef G_SOURCE_CONTINUE\n+#define G_SOURCE_CONTINUE TRUE\n+#endif\n+#ifndef G_SOURCE_REMOVE\n+#define G_SOURCE_REMOVE FALSE\n+#endif\n+\n+/* And this is the final byte of request*/\n+#define VIRTIO_BLK_S_OK 0\n+#define VIRTIO_BLK_S_IOERR 1\n+#define VIRTIO_BLK_S_UNSUPP 2\n+\n+typedef struct vhost_blk_dev {\n+ VuDev vu_dev;\n+ int server_sock;\n+ int blk_fd;\n+ struct virtio_blk_config blkcfg;\n+ char *blk_name;\n+ GMainLoop *loop;\n+ GTree *fdmap; /* fd -> gsource context id */\n+} vhost_blk_dev_t;\n+\n+typedef struct vhost_blk_request {\n+ VuVirtqElement *elem;\n+ int64_t sector_num;\n+ size_t size;\n+ struct virtio_blk_inhdr *in;\n+ struct virtio_blk_outhdr *out;\n+ vhost_blk_dev_t *vdev_blk;\n+ struct VuVirtq *vq;\n+} vhost_blk_request_t;\n+\n+/** refer util/iov.c **/\n+static size_t vu_blk_iov_size(const struct iovec *iov,\n+ const unsigned int iov_cnt)\n+{\n+ size_t len;\n+ unsigned int i;\n+\n+ len = 0;\n+ for (i = 0; i < iov_cnt; i++) {\n+ len += iov[i].iov_len;\n+ }\n+ return len;\n+}\n+\n+/** glib event loop integration for libvhost-user and misc callbacks **/\n+\n+QEMU_BUILD_BUG_ON((int)G_IO_IN != (int)VU_WATCH_IN);\n+QEMU_BUILD_BUG_ON((int)G_IO_OUT != (int)VU_WATCH_OUT);\n+QEMU_BUILD_BUG_ON((int)G_IO_PRI != (int)VU_WATCH_PRI);\n+QEMU_BUILD_BUG_ON((int)G_IO_ERR != (int)VU_WATCH_ERR);\n+QEMU_BUILD_BUG_ON((int)G_IO_HUP != (int)VU_WATCH_HUP);\n+\n+typedef struct vu_blk_gsrc {\n+ GSource parent;\n+ vhost_blk_dev_t *vdev_blk;\n+ GPollFD gfd;\n+ vu_watch_cb vu_cb;\n+} vu_blk_gsrc_t;\n+\n+static gint vu_blk_fdmap_compare(gconstpointer a, gconstpointer b)\n+{\n+ return (b > a) - (b < a);\n+}\n+\n+static gboolean vu_blk_gsrc_prepare(GSource *src, gint *timeout)\n+{\n+ assert(timeout);\n+\n+ *timeout = -1;\n+ return FALSE;\n+}\n+\n+static gboolean vu_blk_gsrc_check(GSource *src)\n+{\n+ vu_blk_gsrc_t *vu_blk_src = (vu_blk_gsrc_t *)src;\n+\n+ assert(vu_blk_src);\n+\n+ return vu_blk_src->gfd.revents & vu_blk_src->gfd.events;\n+}\n+\n+static gboolean vu_blk_gsrc_dispatch(GSource *src,\n+ GSourceFunc cb, gpointer data)\n+{\n+ vhost_blk_dev_t *vdev_blk;\n+ vu_blk_gsrc_t *vu_blk_src = (vu_blk_gsrc_t *)src;\n+\n+ assert(vu_blk_src);\n+ assert(!(vu_blk_src->vu_cb && cb));\n+\n+ vdev_blk = vu_blk_src->vdev_blk;\n+\n+ assert(vdev_blk);\n+\n+ if (cb) {\n+ return cb(data);\n+ }\n+ if (vu_blk_src->vu_cb) {\n+ vu_blk_src->vu_cb(&vdev_blk->vu_dev, vu_blk_src->gfd.revents, data);\n+ }\n+ return G_SOURCE_CONTINUE;\n+}\n+\n+static GSourceFuncs vu_blk_gsrc_funcs = {\n+ vu_blk_gsrc_prepare,\n+ vu_blk_gsrc_check,\n+ vu_blk_gsrc_dispatch,\n+ NULL\n+};\n+\n+static int vu_blk_gsrc_new(vhost_blk_dev_t *vdev_blk, int fd,\n+ GIOCondition cond, vu_watch_cb vu_cb,\n+ GSourceFunc gsrc_cb, gpointer data)\n+{\n+ GSource *vu_blk_gsrc;\n+ vu_blk_gsrc_t *vu_blk_src;\n+ guint id;\n+\n+ assert(vdev_blk);\n+ assert(fd >= 0);\n+ assert(vu_cb || gsrc_cb);\n+ assert(!(vu_cb && gsrc_cb));\n+\n+ vu_blk_gsrc = g_source_new(&vu_blk_gsrc_funcs, sizeof(vu_blk_gsrc_t));\n+ if (!vu_blk_gsrc) {\n+ fprintf(stderr, \"Error creating GSource for new watch\\n\");\n+ return -1;\n+ }\n+ vu_blk_src = (vu_blk_gsrc_t *)vu_blk_gsrc;\n+\n+ vu_blk_src->vdev_blk = vdev_blk;\n+ vu_blk_src->gfd.fd = fd;\n+ vu_blk_src->gfd.events = cond;\n+ vu_blk_src->vu_cb = vu_cb;\n+\n+ g_source_add_poll(vu_blk_gsrc, &vu_blk_src->gfd);\n+ g_source_set_callback(vu_blk_gsrc, gsrc_cb, data, NULL);\n+ id = g_source_attach(vu_blk_gsrc, NULL);\n+ assert(id);\n+ g_source_unref(vu_blk_gsrc);\n+\n+ g_tree_insert(vdev_blk->fdmap, (gpointer)(uintptr_t)fd,\n+ (gpointer)(uintptr_t)id);\n+\n+ return 0;\n+}\n+\n+static void vu_blk_panic_cb(VuDev *vu_dev, const char *buf)\n+{\n+ vhost_blk_dev_t *vdev_blk;\n+\n+ assert(vu_dev);\n+\n+ vdev_blk = container_of(vu_dev, vhost_blk_dev_t, vu_dev);\n+\n+ if (buf) {\n+ fprintf(stderr, \"vu_blk_panic_cb: %s\\n\", buf);\n+ }\n+\n+ if (vdev_blk) {\n+ assert(vdev_blk->loop);\n+ g_main_loop_quit(vdev_blk->loop);\n+ }\n+}\n+\n+static void vu_blk_add_watch_cb(VuDev *vu_dev, int fd, int vu_evt,\n+ vu_watch_cb cb, void *pvt) {\n+ vhost_blk_dev_t *vdev_blk;\n+ guint id;\n+\n+ assert(vu_dev);\n+ assert(fd >= 0);\n+ assert(cb);\n+\n+ vdev_blk = container_of(vu_dev, vhost_blk_dev_t, vu_dev);\n+ if (!vdev_blk) {\n+ vu_blk_panic_cb(vu_dev, NULL);\n+ return;\n+ }\n+\n+ id = (guint)(uintptr_t)g_tree_lookup(vdev_blk->fdmap,\n+ (gpointer)(uintptr_t)fd);\n+ if (id) {\n+ GSource *vu_blk_src = g_main_context_find_source_by_id(NULL, id);\n+ assert(vu_blk_src);\n+ g_source_destroy(vu_blk_src);\n+ (void)g_tree_remove(vdev_blk->fdmap, (gpointer)(uintptr_t)fd);\n+ }\n+\n+ if (vu_blk_gsrc_new(vdev_blk, fd, vu_evt, cb, NULL, pvt)) {\n+ vu_blk_panic_cb(vu_dev, NULL);\n+ }\n+}\n+\n+static void vu_blk_del_watch_cb(VuDev *vu_dev, int fd)\n+{\n+ vhost_blk_dev_t *vdev_blk;\n+ guint id;\n+\n+ assert(vu_dev);\n+ assert(fd >= 0);\n+\n+ vdev_blk = container_of(vu_dev, vhost_blk_dev_t, vu_dev);\n+ if (!vdev_blk) {\n+ vu_blk_panic_cb(vu_dev, NULL);\n+ return;\n+ }\n+\n+ id = (guint)(uintptr_t)g_tree_lookup(vdev_blk->fdmap,\n+ (gpointer)(uintptr_t)fd);\n+ if (id) {\n+ GSource *vu_blk_src = g_main_context_find_source_by_id(NULL, id);\n+ assert(vu_blk_src);\n+ g_source_destroy(vu_blk_src);\n+ (void)g_tree_remove(vdev_blk->fdmap, (gpointer)(uintptr_t)fd);\n+ }\n+}\n+\n+static void vu_blk_req_complete(vhost_blk_request_t *req)\n+{\n+ VuDev *vu_dev = &req->vdev_blk->vu_dev;\n+\n+ /* IO size with 1 extra status byte */\n+ vu_queue_push(vu_dev, req->vq, req->elem,\n+ req->size + 1);\n+ vu_queue_notify(vu_dev, req->vq);\n+\n+ if (req->elem) {\n+ free(req->elem);\n+ }\n+ if (req) {\n+ free(req);\n+ }\n+}\n+\n+static int vu_blk_open(const char *file_name)\n+{\n+ int fd;\n+\n+ fd = open(file_name, O_RDWR | O_DIRECT);\n+ if (fd < 0) {\n+ fprintf(stderr, \"Cannot open file %s, %s\\n\", file_name,\n+ strerror(errno));\n+ return -1;\n+ }\n+\n+ return fd;\n+}\n+\n+static void vu_blk_close(int fd)\n+{\n+ if (fd >= 0) {\n+ close(fd);\n+ }\n+}\n+\n+static ssize_t\n+vu_blk_readv(vhost_blk_request_t *req, struct iovec *iov, uint32_t iovcnt)\n+{\n+ vhost_blk_dev_t *vdev_blk = req->vdev_blk;\n+ ssize_t rc;\n+\n+ if (!iovcnt) {\n+ fprintf(stderr, \"Invalid Read IOV count\\n\");\n+ return -1;\n+ }\n+\n+ req->size = vu_blk_iov_size(iov, iovcnt);\n+ rc = preadv(vdev_blk->blk_fd, iov, iovcnt, req->sector_num * 512);\n+ if (rc < 0) {\n+ fprintf(stderr, \"Block %s, Sector %\"PRIu64\", Size %lu Read Failed\\n\",\n+ vdev_blk->blk_name, req->sector_num, req->size);\n+ return -1;\n+ }\n+\n+ return rc;\n+}\n+\n+static ssize_t\n+vu_blk_writev(vhost_blk_request_t *req, struct iovec *iov, uint32_t iovcnt)\n+{\n+ vhost_blk_dev_t *vdev_blk = req->vdev_blk;\n+ ssize_t rc;\n+\n+ if (!iovcnt) {\n+ fprintf(stderr, \"Invalid Write IOV count\\n\");\n+ return -1;\n+ }\n+\n+ req->size = vu_blk_iov_size(iov, iovcnt);\n+ rc = pwritev(vdev_blk->blk_fd, iov, iovcnt, req->sector_num * 512);\n+ if (rc < 0) {\n+ fprintf(stderr, \"Block %s, Sector %\"PRIu64\", Size %lu Write Failed\\n\",\n+ vdev_blk->blk_name, req->sector_num, req->size);\n+ return -1;\n+ }\n+\n+ return rc;\n+}\n+\n+static void\n+vu_blk_flush(vhost_blk_request_t *req)\n+{\n+ vhost_blk_dev_t *vdev_blk = req->vdev_blk;\n+\n+ if (vdev_blk->blk_fd) {\n+ fsync(vdev_blk->blk_fd);\n+ }\n+}\n+\n+\n+static int vu_virtio_blk_process_req(vhost_blk_dev_t *vdev_blk,\n+ VuVirtq *vq)\n+{\n+ VuVirtqElement *elem;\n+ uint32_t type;\n+ unsigned in_num;\n+ unsigned out_num;\n+ vhost_blk_request_t *req;\n+\n+ elem = vu_queue_pop(&vdev_blk->vu_dev, vq, sizeof(VuVirtqElement));\n+ if (!elem) {\n+ return -1;\n+ }\n+\n+ /* refer to hw/block/virtio_blk.c */\n+ if (elem->out_num < 1 || elem->in_num < 1) {\n+ fprintf(stderr, \"virtio-blk request missing headers\\n\");\n+ free(elem);\n+ return -1;\n+ }\n+\n+ req = calloc(1, sizeof(*req));\n+ assert(req);\n+ req->vdev_blk = vdev_blk;\n+ req->vq = vq;\n+ req->elem = elem;\n+\n+ in_num = elem->in_num;\n+ out_num = elem->out_num;\n+\n+ /* don't support VIRTIO_F_ANY_LAYOUT and virtio 1.0 only */\n+ if (elem->out_sg[0].iov_len < sizeof(struct virtio_blk_outhdr)) {\n+ fprintf(stderr, \"Invalid outhdr size\\n\");\n+ goto err;\n+ }\n+ req->out = (struct virtio_blk_outhdr *)elem->out_sg[0].iov_base;\n+ out_num--;\n+\n+ if (elem->in_sg[in_num - 1].iov_len < sizeof(struct virtio_blk_inhdr)) {\n+ fprintf(stderr, \"Invalid inhdr size\\n\");\n+ goto err;\n+ }\n+ req->in = (struct virtio_blk_inhdr *)elem->in_sg[in_num - 1].iov_base;\n+ in_num--;\n+\n+ type = le32_to_cpu(req->out->type);\n+ switch (type & ~(VIRTIO_BLK_T_OUT | VIRTIO_BLK_T_BARRIER)) {\n+ case VIRTIO_BLK_T_IN: {\n+ ssize_t ret = 0;\n+ bool is_write = type & VIRTIO_BLK_T_OUT;\n+ req->sector_num = le64_to_cpu(req->out->sector);\n+ if (is_write) {\n+ ret = vu_blk_writev(req, &elem->out_sg[1], out_num);\n+ } else {\n+ ret = vu_blk_readv(req, &elem->in_sg[0], in_num);\n+ }\n+ if (ret >= 0) {\n+ req->in->status = VIRTIO_BLK_S_OK;\n+ } else {\n+ req->in->status = VIRTIO_BLK_S_IOERR;\n+ }\n+ vu_blk_req_complete(req);\n+ break;\n+ }\n+ case VIRTIO_BLK_T_FLUSH: {\n+ vu_blk_flush(req);\n+ req->in->status = VIRTIO_BLK_S_OK;\n+ vu_blk_req_complete(req);\n+ break;\n+ }\n+ case VIRTIO_BLK_T_GET_ID: {\n+ size_t size = MIN(vu_blk_iov_size(&elem->in_sg[0], in_num),\n+ VIRTIO_BLK_ID_BYTES);\n+ snprintf(elem->in_sg[0].iov_base, size, \"%s\", \"vhost_user_blk\");\n+ req->in->status = VIRTIO_BLK_S_OK;\n+ req->size = elem->in_sg[0].iov_len;\n+ vu_blk_req_complete(req);\n+ break;\n+ }\n+ default: {\n+ req->in->status = VIRTIO_BLK_S_UNSUPP;\n+ vu_blk_req_complete(req);\n+ break;\n+ }\n+ }\n+\n+ return 0;\n+\n+err:\n+ free(elem);\n+ free(req);\n+ return -1;\n+}\n+\n+static void vu_blk_process_vq(VuDev *vu_dev, int idx)\n+{\n+ vhost_blk_dev_t *vdev_blk;\n+ VuVirtq *vq;\n+ int ret;\n+\n+ if ((idx < 0) || (idx >= VHOST_MAX_NR_VIRTQUEUE)) {\n+ fprintf(stderr, \"VQ Index out of range: %d\\n\", idx);\n+ vu_blk_panic_cb(vu_dev, NULL);\n+ return;\n+ }\n+\n+ vdev_blk = container_of(vu_dev, vhost_blk_dev_t, vu_dev);\n+ assert(vdev_blk);\n+\n+ vq = vu_get_queue(vu_dev, idx);\n+ assert(vq);\n+\n+ while (1) {\n+ ret = vu_virtio_blk_process_req(vdev_blk, vq);\n+ if (ret) {\n+ break;\n+ }\n+ }\n+}\n+\n+static void vu_blk_queue_set_started(VuDev *vu_dev, int idx, bool started)\n+{\n+ VuVirtq *vq;\n+\n+ assert(vu_dev);\n+\n+ if ((idx < 0) || (idx >= VHOST_MAX_NR_VIRTQUEUE)) {\n+ fprintf(stderr, \"VQ Index out of range: %d\\n\", idx);\n+ vu_blk_panic_cb(vu_dev, NULL);\n+ return;\n+ }\n+\n+ vq = vu_get_queue(vu_dev, idx);\n+ vu_set_queue_handler(vu_dev, vq, started ? vu_blk_process_vq : NULL);\n+}\n+\n+static uint64_t\n+vu_blk_get_features(VuDev *dev)\n+{\n+ return 1ull << VIRTIO_BLK_F_SIZE_MAX |\n+ 1ull << VIRTIO_BLK_F_SEG_MAX |\n+ 1ull << VIRTIO_BLK_F_TOPOLOGY |\n+ 1ull << VIRTIO_BLK_F_BLK_SIZE |\n+ 1ull << VIRTIO_F_VERSION_1 |\n+ 1ull << VHOST_USER_F_PROTOCOL_FEATURES;\n+}\n+\n+static int\n+vu_blk_get_config(VuDev *vu_dev, uint8_t *config, size_t len)\n+{\n+ vhost_blk_dev_t *vdev_blk;\n+\n+ if (len != sizeof(struct virtio_blk_config)) {\n+ return -1;\n+ }\n+ vdev_blk = container_of(vu_dev, vhost_blk_dev_t, vu_dev);\n+ memcpy(config, &vdev_blk->blkcfg, len);\n+\n+ return 0;\n+}\n+\n+static const VuDevIface vu_blk_iface = {\n+ .get_features = vu_blk_get_features,\n+ .queue_set_started = vu_blk_queue_set_started,\n+ .get_config = vu_blk_get_config,\n+};\n+\n+static gboolean vu_blk_vhost_cb(gpointer data)\n+{\n+ VuDev *vu_dev = (VuDev *)data;\n+\n+ assert(vu_dev);\n+\n+ if (!vu_dispatch(vu_dev) != 0) {\n+ fprintf(stderr, \"Error processing vhost message\\n\");\n+ vu_blk_panic_cb(vu_dev, NULL);\n+ return G_SOURCE_REMOVE;\n+ }\n+\n+ return G_SOURCE_CONTINUE;\n+}\n+\n+static int unix_sock_new(char *unix_fn)\n+{\n+ int sock;\n+ struct sockaddr_un un;\n+ size_t len;\n+\n+ assert(unix_fn);\n+\n+ sock = socket(AF_UNIX, SOCK_STREAM, 0);\n+ if (sock <= 0) {\n+ perror(\"socket\");\n+ return -1;\n+ }\n+\n+ un.sun_family = AF_UNIX;\n+ (void)snprintf(un.sun_path, sizeof(un.sun_path), \"%s\", unix_fn);\n+ len = sizeof(un.sun_family) + strlen(un.sun_path);\n+\n+ (void)unlink(unix_fn);\n+ if (bind(sock, (struct sockaddr *)&un, len) < 0) {\n+ perror(\"bind\");\n+ goto fail;\n+ }\n+\n+ if (listen(sock, 1) < 0) {\n+ perror(\"listen\");\n+ goto fail;\n+ }\n+\n+ return sock;\n+\n+fail:\n+ (void)close(sock);\n+\n+ return -1;\n+}\n+\n+static int vdev_blk_run(struct vhost_blk_dev *vdev_blk)\n+{\n+ int cli_sock;\n+ int ret = 0;\n+\n+ assert(vdev_blk);\n+ assert(vdev_blk->server_sock >= 0);\n+ assert(vdev_blk->loop);\n+\n+ cli_sock = accept(vdev_blk->server_sock, (void *)0, (void *)0);\n+ if (cli_sock < 0) {\n+ perror(\"accept\");\n+ return -1;\n+ }\n+\n+ vu_init(&vdev_blk->vu_dev,\n+ cli_sock,\n+ vu_blk_panic_cb,\n+ vu_blk_add_watch_cb,\n+ vu_blk_del_watch_cb,\n+ &vu_blk_iface);\n+\n+ if (vu_blk_gsrc_new(vdev_blk, cli_sock, G_IO_IN, NULL, vu_blk_vhost_cb,\n+ &vdev_blk->vu_dev)) {\n+ ret = -1;\n+ goto out;\n+ }\n+\n+ g_main_loop_run(vdev_blk->loop);\n+\n+out:\n+ vu_deinit(&vdev_blk->vu_dev);\n+ return ret;\n+}\n+\n+static void vdev_blk_deinit(struct vhost_blk_dev *vdev_blk)\n+{\n+ if (!vdev_blk) {\n+ return;\n+ }\n+\n+ if (vdev_blk->server_sock >= 0) {\n+ struct sockaddr_storage ss;\n+ socklen_t sslen = sizeof(ss);\n+\n+ if (getsockname(vdev_blk->server_sock, (struct sockaddr *)&ss,\n+ &sslen) == 0) {\n+ struct sockaddr_un *su = (struct sockaddr_un *)&ss;\n+ (void)unlink(su->sun_path);\n+ }\n+\n+ (void)close(vdev_blk->server_sock);\n+ vdev_blk->server_sock = -1;\n+ }\n+\n+ if (vdev_blk->loop) {\n+ g_main_loop_unref(vdev_blk->loop);\n+ vdev_blk->loop = NULL;\n+ }\n+\n+ if (vdev_blk->blk_fd) {\n+ vu_blk_close(vdev_blk->blk_fd);\n+ }\n+}\n+\n+static void\n+vu_blk_initialize_config(int fd, struct virtio_blk_config *config)\n+{\n+ off64_t capacity;\n+\n+ capacity = lseek64(fd, 0, SEEK_END);\n+ config->capacity = capacity >> 9;\n+ config->blk_size = 512;\n+ config->size_max = 65536;\n+ config->seg_max = 128 - 2;\n+ config->min_io_size = 1;\n+ config->opt_io_size = 1;\n+ config->num_queues = 1;\n+}\n+\n+static vhost_blk_dev_t *\n+vdev_blk_new(char *unix_fn, char *blk_file)\n+{\n+ vhost_blk_dev_t *vdev_blk = NULL;\n+\n+ vdev_blk = calloc(1, sizeof(struct vhost_blk_dev));\n+ if (!vdev_blk) {\n+ fprintf(stderr, \"calloc: %s\", strerror(errno));\n+ return NULL;\n+ }\n+\n+ vdev_blk->server_sock = unix_sock_new(unix_fn);\n+ if (vdev_blk->server_sock < 0) {\n+ goto err;\n+ }\n+\n+ vdev_blk->loop = g_main_loop_new(NULL, FALSE);\n+ if (!vdev_blk->loop) {\n+ fprintf(stderr, \"Error creating glib event loop\");\n+ goto err;\n+ }\n+\n+ vdev_blk->fdmap = g_tree_new(vu_blk_fdmap_compare);\n+ if (!vdev_blk->fdmap) {\n+ fprintf(stderr, \"Error creating glib tree for fdmap\");\n+ goto err;\n+ }\n+\n+ vdev_blk->blk_fd = vu_blk_open(blk_file);\n+ if (vdev_blk->blk_fd < 0) {\n+ fprintf(stderr, \"Error open block device %s\\n\", blk_file);\n+ goto err;\n+ }\n+ vdev_blk->blk_name = blk_file;\n+\n+ /* fill virtio_blk_config with block parameters */\n+ vu_blk_initialize_config(vdev_blk->blk_fd, &vdev_blk->blkcfg);\n+\n+ return vdev_blk;\n+\n+err:\n+ vdev_blk_deinit(vdev_blk);\n+ free(vdev_blk);\n+\n+ return NULL;\n+}\n+\n+int main(int argc, char **argv)\n+{\n+ int opt;\n+ char *unix_socket = NULL;\n+ char *blk_file = NULL;\n+ vhost_blk_dev_t *vdev_blk = NULL;\n+\n+ while ((opt = getopt(argc, argv, \"b:h:s:\")) != -1) {\n+ switch (opt) {\n+ case 'b':\n+ blk_file = strdup(optarg);\n+ break;\n+ case 's':\n+ unix_socket = strdup(optarg);\n+ break;\n+ case 'h':\n+ default:\n+ printf(\"Usage: %s [-b block device or file, -s UNIX domain socket]\"\n+ \" | [ -h ]\\n\", argv[0]);\n+ break;\n+ }\n+ }\n+\n+ if (!unix_socket || !blk_file) {\n+ printf(\"Usage: %s [-b block device or file, -s UNIX domain socket] |\"\n+ \" [ -h ]\\n\", argv[0]);\n+ return -1;\n+ }\n+\n+ vdev_blk = vdev_blk_new(unix_socket, blk_file);\n+ if (!vdev_blk) {\n+ goto err;\n+ }\n+\n+ if (vdev_blk_run(vdev_blk) != 0) {\n+ goto err;\n+ }\n+\n+err:\n+ if (vdev_blk) {\n+ vdev_blk_deinit(vdev_blk);\n+ free(vdev_blk);\n+ }\n+ if (unix_socket) {\n+ free(unix_socket);\n+ }\n+ if (blk_file) {\n+ free(blk_file);\n+ }\n+\n+ return 0;\n+}\n+\n", "prefixes": [ "v3", "4/4" ] }