[v3,0/3] Add ZynqMP VCU/Allegro DVT H.264 encoder driver

Message ID	20190213175124.3695-1-m.tretter@pengutronix.de
Headers	show Return-Path: <devicetree-owner@vger.kernel.org> From: Michael Tretter <m.tretter@pengutronix.de> To: linux-media@vger.kernel.org, devicetree@vger.kernel.org Cc: kernel@pengutronix.de, robh+dt@kernel.org, mchehab@kernel.org, tfiga@chromium.org, dshah@xilinx.com, Michael Tretter <m.tretter@pengutronix.de> Subject: [PATCH v3 0/3] Add ZynqMP VCU/Allegro DVT H.264 encoder driver Date: Wed, 13 Feb 2019 18:51:21 +0100 Message-Id: <20190213175124.3695-1-m.tretter@pengutronix.de> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: devicetree-owner@vger.kernel.org Precedence: bulk
Series	Add ZynqMP VCU/Allegro DVT H.264 encoder driver \| expand [v3,0/3] Add ZynqMP VCU/Allegro DVT H.264 encoder driver [v3,1/3] media: dt-bindings: media: document allegro-dvt bindings

Michael Tretter Feb. 13, 2019, 5:51 p.m. UTC

This is v3 of the series to add support for the Allegro DVT H.264 encoder
found in the EV family of the Xilinx ZynqMP platform.

The devicetree bindings now also include the clocks as documented in PG252
"H.264/H.265 Video Codec Unit v1.2" [0].

Most changes involve added or removed v4l2 callbacks that were mentioned in
the review comments to make the v4l2-compliance test suite happy.

I also updated the firmware loading and device probing to properly handle
failures and correctly reset or stop the MCU.

Still on my list is a check against the documentation of the memory-to-memory
stateful video encoder interface. If there are fixes necessary, I will include
them in a v4.

I did not introduce a separate queue for internal buffers for encoded frames.
I am still considering this split, but I want to have tracing in place before
doing this. The changes will completely internal to the driver and can be done
later anyway.

Each patch also contains a more detailed changelog.

Michael

[0] https://www.xilinx.com/support/documentation/ip_documentation/vcu/v1_2/pg252-vcu.pdf

v2 -> v3:
- add clocks to devicetree bindings
- fix devicetree binding according to review comments on v2
- add missing v4l2 callbacks
- drop unnecessary v4l2 callbacks
- drop debug module parameter poison_capture_buffers
- check firmware size before loading firmware
- rework error handling

v1 -> v2:
- clean up debug log levels
- fix unused variable in allegro_mbox_init
- fix uninitialized variable in allegro_mbox_write
- fix global module parameters
- fix Kconfig dependencies
- return h264 as default codec for mcu
- implement device reset as documented
- document why irq does not wait for clear
- rename ENCODE_ONE_FRM to ENCODE_FRAME
- allow error codes for mcu_channel_id
- move control handler to channel
- add fw version check
- add support for colorspaces
- enable configuration of H.264 levels
- enable configuration of frame size
- enable configuration of bit rate and CPB size
- enable configuration of GOP size
- rework response handling
- fix missing error handling in allegro_h264_write_sps


Michael Tretter (3):
  media: dt-bindings: media: document allegro-dvt bindings
  [media] allegro: add Allegro DVT video IP core driver
  [media] allegro: add SPS/PPS nal unit writer

 .../devicetree/bindings/media/allegro.txt     |   43 +
 MAINTAINERS                                   |    6 +
 drivers/staging/media/Kconfig                 |    2 +
 drivers/staging/media/Makefile                |    1 +
 drivers/staging/media/allegro-dvt/Kconfig     |   16 +
 drivers/staging/media/allegro-dvt/Makefile    |    6 +
 .../staging/media/allegro-dvt/allegro-core.c  | 2909 +++++++++++++++++
 drivers/staging/media/allegro-dvt/nal-h264.c  | 1278 ++++++++
 drivers/staging/media/allegro-dvt/nal-h264.h  |  188 ++
 9 files changed, 4449 insertions(+)
 create mode 100644 Documentation/devicetree/bindings/media/allegro.txt
 create mode 100644 drivers/staging/media/allegro-dvt/Kconfig
 create mode 100644 drivers/staging/media/allegro-dvt/Makefile
 create mode 100644 drivers/staging/media/allegro-dvt/allegro-core.c
 create mode 100644 drivers/staging/media/allegro-dvt/nal-h264.c
 create mode 100644 drivers/staging/media/allegro-dvt/nal-h264.h

Michael Tretter Feb. 13, 2019, 5:54 p.m. UTC | #1

On Wed, 13 Feb 2019 18:51:21 +0100, Michael Tretter wrote:
> This is v3 of the series to add support for the Allegro DVT H.264 encoder
> found in the EV family of the Xilinx ZynqMP platform.
> 
> The devicetree bindings now also include the clocks as documented in PG252
> "H.264/H.265 Video Codec Unit v1.2" [0].
> 
> Most changes involve added or removed v4l2 callbacks that were mentioned in
> the review comments to make the v4l2-compliance test suite happy.

And here are the test results:

v4l2-compliance SHA: 9e375c965f200538817ab712330a86a123124394, 64 bits

Compliance test for allegro device /dev/video4:

Driver Info:
        Driver name      : allegro
        Card type        : Allegro DVT Video Encoder
        Bus info         : platform:a0009000.al5e
        Driver version   : 5.0.0
        Capabilities     : 0x84208000
                Video Memory-to-Memory
                Streaming
                Extended Pix Format
                Device Capabilities
        Device Caps      : 0x04208000
                Video Memory-to-Memory
                Streaming
                Extended Pix Format
        Detected Stateful Encoder

Required ioctls:
        test VIDIOC_QUERYCAP: OK

Allow for multiple opens:
        test second /dev/video4 open: OK
        test VIDIOC_QUERYCAP: OK
        test VIDIOC_G/S_PRIORITY: OK
        test for unlimited opens: OK

Debug ioctls:
        test VIDIOC_DBG_G/S_REGISTER: OK
        test VIDIOC_LOG_STATUS: OK (Not Supported)

Input ioctls:
        test VIDIOC_G/S_TUNER/ENUM_FREQ_BANDS: OK (Not Supported)
        test VIDIOC_G/S_FREQUENCY: OK (Not Supported)
        test VIDIOC_S_HW_FREQ_SEEK: OK (Not Supported)
        test VIDIOC_ENUMAUDIO: OK (Not Supported)
        test VIDIOC_G/S/ENUMINPUT: OK (Not Supported)
        test VIDIOC_G/S_AUDIO: OK (Not Supported)
        Inputs: 0 Audio Inputs: 0 Tuners: 0

Output ioctls:
        test VIDIOC_G/S_MODULATOR: OK (Not Supported)
        test VIDIOC_G/S_FREQUENCY: OK (Not Supported)
        test VIDIOC_ENUMAUDOUT: OK (Not Supported)
        test VIDIOC_G/S/ENUMOUTPUT: OK (Not Supported)
        test VIDIOC_G/S_AUDOUT: OK (Not Supported)
        Outputs: 0 Audio Outputs: 0 Modulators: 0

Input/Output configuration ioctls:
        test VIDIOC_ENUM/G/S/QUERY_STD: OK (Not Supported)
        test VIDIOC_ENUM/G/S/QUERY_DV_TIMINGS: OK (Not Supported)
        test VIDIOC_DV_TIMINGS_CAP: OK (Not Supported)
        test VIDIOC_G/S_EDID: OK (Not Supported)

Control ioctls:
        test VIDIOC_QUERY_EXT_CTRL/QUERYMENU: OK
        test VIDIOC_QUERYCTRL: OK
        test VIDIOC_G/S_CTRL: OK
        test VIDIOC_G/S/TRY_EXT_CTRLS: OK
        test VIDIOC_(UN)SUBSCRIBE_EVENT/DQEVENT: OK
        test VIDIOC_G/S_JPEGCOMP: OK (Not Supported)
        Standard Controls: 8 Private Controls: 0

Format ioctls:
        test VIDIOC_ENUM_FMT/FRAMESIZES/FRAMEINTERVALS: OK
        test VIDIOC_G/S_PARM: OK (Not Supported)
        test VIDIOC_G_FBUF: OK (Not Supported)
        test VIDIOC_G_FMT: OK
        test VIDIOC_TRY_FMT: OK
        test VIDIOC_S_FMT: OK
        test VIDIOC_G_SLICED_VBI_CAP: OK (Not Supported)
        test Cropping: OK (Not Supported)
        test Composing: OK (Not Supported)
        test Scaling: OK

Codec ioctls:
        test VIDIOC_(TRY_)ENCODER_CMD: OK
        test VIDIOC_G_ENC_INDEX: OK (Not Supported)
        test VIDIOC_(TRY_)DECODER_CMD: OK (Not Supported)

Buffer ioctls:
        test VIDIOC_REQBUFS/CREATE_BUFS/QUERYBUF: OK
        test VIDIOC_EXPBUF: OK
        test Requests: OK (Not Supported)

Test input 0:

Streaming ioctls:
        test read/write: OK (Not Supported)
        test blocking wait: OK
        Video Capture: Captured 60 buffers                
        test MMAP (no poll): OK
        Video Capture: Captured 60 buffers                
        test MMAP (select): OK
        test USERPTR (no poll): OK (Not Supported)
        test USERPTR (select): OK (Not Supported)
        test DMABUF: Cannot test, specify --expbuf-device

Total for allegro device /dev/video4: 50, Succeeded: 50, Failed: 0, Warnings: 0

> 
> I also updated the firmware loading and device probing to properly handle
> failures and correctly reset or stop the MCU.
> 
> Still on my list is a check against the documentation of the memory-to-memory
> stateful video encoder interface. If there are fixes necessary, I will include
> them in a v4.
> 
> I did not introduce a separate queue for internal buffers for encoded frames.
> I am still considering this split, but I want to have tracing in place before
> doing this. The changes will completely internal to the driver and can be done
> later anyway.
> 
> Each patch also contains a more detailed changelog.
> 
> Michael

Hans Verkuil Feb. 25, 2019, 11:10 a.m. UTC | #2

On 2/13/19 6:51 PM, Michael Tretter wrote:
> Add a V4L2 mem-to-mem driver for Allegro DVT video IP cores as found in
> the EV family of the Xilinx ZynqMP SoC. The Zynq UltraScale+ Device
> Technical Reference Manual uses the term VCU (Video Codec Unit) for the
> encoder, decoder and system integration block.
> 
> This driver takes care of interacting with the MicroBlaze MCU that
> controls the actual IP cores. The IP cores and MCU are integrated in the
> FPGA. The xlnx_vcu driver is responsible for configuring the clocks and
> providing information about the codec configuration.
> 
> The driver currently only supports the H.264 video encoder.
> 
> Signed-off-by: Michael Tretter <m.tretter@pengutronix.de>
> ---

<snip>

> +static void allegro_finish_frame(struct allegro_channel *channel,
> +				 struct mcu_msg_encode_frame_response *msg)
> +{
> +	struct allegro_dev *dev = channel->dev;
> +	struct vb2_v4l2_buffer *src_buf;
> +	struct vb2_v4l2_buffer *dst_buf;
> +	struct {
> +		u32 offset;
> +		u32 size;
> +	} *partition;
> +	enum vb2_buffer_state state = VB2_BUF_STATE_ERROR;
> +
> +	src_buf = v4l2_m2m_src_buf_remove(channel->fh.m2m_ctx);
> +
> +	dst_buf = v4l2_m2m_dst_buf_remove(channel->fh.m2m_ctx);
> +	dst_buf->sequence = channel->csequence++;
> +
> +	if (msg->error_code) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "channel %d: error while encoding frame: %x\n",
> +			 channel->mcu_channel_id, msg->error_code);
> +		goto err;
> +	}
> +
> +	if (msg->partition_table_size != 1) {
> +		v4l2_warn(&dev->v4l2_dev,
> +			  "channel %d: only handling first partition table entry (%d entries)\n",
> +			  channel->mcu_channel_id, msg->partition_table_size);
> +	}
> +
> +	if (msg->partition_table_offset +
> +	    msg->partition_table_size * sizeof(*partition) >
> +	    vb2_plane_size(&dst_buf->vb2_buf, 0)) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "channel %d: partition table outside of dst_buf\n",
> +			 channel->mcu_channel_id);
> +		goto err;
> +	}
> +
> +	partition =
> +	    vb2_plane_vaddr(&dst_buf->vb2_buf, 0) + msg->partition_table_offset;
> +	if (partition->offset + partition->size >
> +	    vb2_plane_size(&dst_buf->vb2_buf, 0)) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "channel %d: encoded frame is outside of dst_buf (offset 0x%x, size 0x%x)\n",
> +			 channel->mcu_channel_id, partition->offset,
> +			 partition->size);
> +		goto err;
> +	}
> +
> +	v4l2_dbg(2, debug, &dev->v4l2_dev,
> +		"channel %d: encoded frame of size %d is at offset 0x%x\n",
> +		channel->mcu_channel_id, partition->size, partition->offset);
> +
> +	/*
> +	 * TODO The partition->offset differs from the ENCODER_STREAM_OFFSET.
> +	 * Does the encoder add any data before its configured offset that we
> +	 * need to handle?
> +	 */
> +
> +	vb2_set_plane_payload(&dst_buf->vb2_buf, 0,
> +			      partition->offset + partition->size);
> +
> +	state = VB2_BUF_STATE_DONE;
> +
> +	dst_buf->vb2_buf.timestamp = src_buf->vb2_buf.timestamp;
> +	dst_buf->field = src_buf->field;
> +
> +	dst_buf->flags &= ~V4L2_BUF_FLAG_TSTAMP_SRC_MASK;
> +	dst_buf->flags |= src_buf->flags & V4L2_BUF_FLAG_TSTAMP_SRC_MASK;
> +	if (src_buf->flags & V4L2_BUF_FLAG_TIMECODE) {
> +		dst_buf->timecode = src_buf->timecode;
> +		dst_buf->flags |= V4L2_BUF_FLAG_TIMECODE;
> +	} else {
> +		dst_buf->flags &= ~V4L2_BUF_FLAG_TIMECODE;
> +	}

Use v4l2_m2m_buf_copy_metadata() instead of copying this manually.

> +	if (msg->is_idr) {
> +		dst_buf->flags |= V4L2_BUF_FLAG_KEYFRAME;
> +		dst_buf->flags &= ~V4L2_BUF_FLAG_PFRAME;
> +	} else {
> +		dst_buf->flags |= V4L2_BUF_FLAG_PFRAME;
> +		dst_buf->flags &= ~V4L2_BUF_FLAG_KEYFRAME;
> +	}
> +
> +	v4l2_dbg(1, debug, &dev->v4l2_dev,
> +		 "channel %d: encoded frame (%s%s, %d bytes)\n",
> +		 channel->mcu_channel_id,
> +		 msg->is_idr ? "IDR, " : "",
> +		 msg->slice_type == AL_ENC_SLICE_TYPE_I ? "I slice" :
> +		 msg->slice_type == AL_ENC_SLICE_TYPE_P ? "P slice" : "unknown",
> +		 partition->size);
> +
> +err:
> +	v4l2_m2m_buf_done(src_buf, VB2_BUF_STATE_DONE);
> +
> +	if (channel->stop) {
> +		const struct v4l2_event eos_event = {
> +			.type = V4L2_EVENT_EOS
> +		};
> +		dst_buf->flags |= V4L2_BUF_FLAG_LAST;
> +		v4l2_event_queue_fh(&channel->fh, &eos_event);
> +	}
> +
> +	v4l2_m2m_buf_done(dst_buf, state);
> +	v4l2_m2m_job_finish(dev->m2m_dev, channel->fh.m2m_ctx);
> +}
> +
> +static int allegro_handle_init(struct allegro_dev *dev,
> +			       struct mcu_msg_init_reply *msg)
> +{
> +	complete(&dev->init_complete);
> +
> +	return 0;
> +}
> +
> +static int
> +allegro_handle_create_channel(struct allegro_dev *dev,
> +			      struct mcu_msg_create_channel_response *msg)
> +{
> +	struct allegro_channel *channel;
> +	int err = 0;
> +
> +	channel = allegro_find_channel_by_user_id(dev, msg->user_id);
> +	if (IS_ERR(channel)) {
> +		v4l2_warn(&dev->v4l2_dev,
> +			  "received %s for unknown user %d\n",
> +			  msg_type_name(msg->header.type),
> +			  msg->user_id);
> +		return -EINVAL;
> +	}
> +
> +	if (msg->error_code) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "user %d: failed to create channel: error %d",
> +			 channel->user_id, msg->error_code);
> +		return -EINVAL;
> +	}
> +
> +	channel->mcu_channel_id = msg->channel_id;
> +	v4l2_dbg(1, debug, &dev->v4l2_dev,
> +		 "user %d: channel has channel id %d\n",
> +		 channel->user_id, channel->mcu_channel_id);
> +
> +	v4l2_dbg(1, debug, &dev->v4l2_dev,
> +		 "channel %d: intermediate buffers: %d x %d bytes\n",
> +		 channel->mcu_channel_id, msg->int_buffers_count,
> +		 msg->int_buffers_size);
> +	err = allocate_intermediate_buffers(channel, msg->int_buffers_count,
> +					    msg->int_buffers_size);
> +	if (err) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "channel %d: failed to allocate intermediate buffers",
> +			 channel->mcu_channel_id);
> +		goto out;
> +	}
> +	allegro_mcu_push_buffer_intermediate(channel);
> +	if (err) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "channel %d: failed to push reference buffers",
> +			 channel->mcu_channel_id);
> +		goto out;
> +	}
> +
> +	v4l2_dbg(1, debug, &dev->v4l2_dev,
> +		 "channel %d: reference buffers: %d x %d bytes\n",
> +		 channel->mcu_channel_id, msg->rec_buffers_count,
> +		 msg->rec_buffers_size);
> +	err = allocate_reference_buffers(channel, msg->rec_buffers_count,
> +					 msg->rec_buffers_size);
> +	if (err) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "channel %d: failed to allocate reference buffers",
> +			 channel->mcu_channel_id);
> +		goto out;
> +	}
> +	err = allegro_mcu_push_buffer_reference(channel);
> +	if (err) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "channel %d: failed to push reference buffers",
> +			 channel->mcu_channel_id);
> +		goto out;
> +	}
> +
> +	channel->created = true;
> +
> +	/*
> +	 * FIXME Need to send CHANNEL_DESTROY if we fail to allocate the
> +	 * intermediate or reference buffers?
> +	 */
> +out:
> +	complete(&channel->completion);
> +	return err;
> +}
> +
> +static int
> +allegro_handle_destroy_channel(struct allegro_dev *dev,
> +		struct mcu_msg_destroy_channel_response *msg)
> +{
> +	struct allegro_channel *channel;
> +
> +	channel = allegro_find_channel_by_channel_id(dev, msg->channel_id);
> +	if (IS_ERR(channel)) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "received %s for unknown channel %d\n",
> +			 msg_type_name(msg->header.type),
> +			 msg->channel_id);
> +		return -EINVAL;
> +	}
> +
> +	v4l2_dbg(2, debug, &dev->v4l2_dev,
> +			"user %d: vcu destroyed channel %d\n",
> +			channel->user_id, channel->mcu_channel_id);
> +	complete(&channel->completion);
> +
> +	return 0;
> +}
> +
> +static int
> +allegro_handle_encode_frame(struct allegro_dev *dev,
> +			    struct mcu_msg_encode_frame_response *msg)
> +{
> +	struct allegro_channel *channel;
> +
> +	channel = allegro_find_channel_by_channel_id(dev, msg->channel_id);
> +	if (IS_ERR(channel)) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "received %s for unknown channel %d\n",
> +			 msg_type_name(msg->header.type),
> +			 msg->channel_id);
> +		return -EINVAL;
> +	}
> +
> +	allegro_finish_frame(channel, msg);
> +
> +	return 0;
> +}
> +
> +static int allegro_receive_message(struct allegro_dev *dev)
> +{
> +	struct mcu_msg_header *header;
> +	ssize_t size;
> +	int err = 0;
> +
> +	/*
> +	 * FIXME header is struct mcu_msg_header, but we are allocating memory
> +	 * for a struct mcu_msg_encode_one_frm_response, because we only need
> +	 * to parse the header before we can determine the type and size of
> +	 * the actual message and struct mcu_msg_encode_one_frm_response is
> +	 * currently the largest message.
> +	 */
> +	header = kmalloc(sizeof(struct mcu_msg_encode_frame_response),
> +			 GFP_KERNEL);
> +	if (!header)
> +		return -ENOMEM;
> +
> +	size = allegro_mbox_read(dev, &dev->mbox_status, header,
> +				 sizeof(struct mcu_msg_encode_frame_response));
> +	if (size < sizeof(*header)) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "invalid mbox message (%ld): must be at least %lu\n",
> +			 size, sizeof(*header));
> +		err = -EINVAL;
> +		goto out;
> +	}
> +
> +	switch(header->type) {
> +	case MCU_MSG_TYPE_INIT:
> +		err = allegro_handle_init(dev,
> +				(struct mcu_msg_init_reply *) header);
> +		break;
> +	case MCU_MSG_TYPE_CREATE_CHANNEL:
> +		err = allegro_handle_create_channel(dev,
> +				(struct mcu_msg_create_channel_response *)header);
> +		break;
> +	case MCU_MSG_TYPE_DESTROY_CHANNEL:
> +		err = allegro_handle_destroy_channel(dev,
> +				(struct mcu_msg_destroy_channel_response *)header);
> +		break;
> +	case MCU_MSG_TYPE_ENCODE_FRAME:
> +		err = allegro_handle_encode_frame(dev,
> +				(struct mcu_msg_encode_frame_response *)header);
> +		break;
> +	default:
> +		v4l2_warn(&dev->v4l2_dev,
> +			  "%s: unknown message %s\n",
> +			  __func__, msg_type_name(header->type));
> +		err = -EINVAL;
> +		break;
> +	}
> +
> +out:
> +	kfree(header);
> +
> +	return err;
> +}
> +
> +irqreturn_t allegro_hardirq(int irq, void *data)
> +{
> +	struct allegro_dev *dev = data;
> +	unsigned int status;
> +
> +	regmap_read(dev->regmap, AL5_ITC_CPU_IRQ_STA, &status);
> +	if (!(status & AL5_ITC_CPU_IRQ_STA_TRIGGERED))
> +		return IRQ_NONE;
> +
> +	regmap_write(dev->regmap, AL5_ITC_CPU_IRQ_CLR, status);
> +
> +	/*
> +	 * The downstream driver suggests to wait until the clear has
> +	 * propagated through the hardware, i.e.,
> +	 *
> +	 * 	!(read(AL5_ITC_CPU_IRQ_STA) & AL5_ITC_CPU_IRQ_STA_TRIGGERED)
> +	 *
> +	 * I assume that waiting for the interrupt to be cleared is not
> +	 * required and just continue execution.
> +	 */
> +
> +	return IRQ_WAKE_THREAD;
> +}
> +
> +irqreturn_t allegro_irq_thread(int irq, void *data)
> +{
> +	struct allegro_dev *dev = data;
> +
> +	allegro_receive_message(dev);
> +
> +	return IRQ_HANDLED;
> +}
> +
> +static void allegro_copy_firmware(struct allegro_dev *dev,
> +				  const u8 * const buf, size_t size)
> +{
> +	int err = 0;
> +
> +	v4l2_dbg(1, debug, &dev->v4l2_dev,
> +		"copy mcu firmware (%lu B) to SRAM\n", size);
> +	err = regmap_bulk_write(dev->sram, 0x0, buf, size / 4);
> +	if (err)
> +		v4l2_err(&dev->v4l2_dev,
> +			 "failed to copy firmware: %d\n", err);
> +}
> +
> +static void allegro_copy_fw_codec(struct allegro_dev *dev,
> +				  const u8 * const buf, size_t size)
> +{
> +	int err;
> +	dma_addr_t icache_offset, dcache_offset;
> +
> +	/*
> +	 * The downstream allocates 600 KB for the codec firmware to have some
> +	 * extra space for "possible extensions." My tests were fine with
> +	 * allocating just enough memory for the actual firmware, but I am not
> +	 * sure that the firmware really does not use the remaining space.
> +	 */
> +	err = allegro_alloc_buffer(dev, &dev->firmware, size);
> +	if (err) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "failed to allocate %lu bytes for firmware\n", size);
> +		return;
> +	}
> +
> +	v4l2_dbg(1, debug, &dev->v4l2_dev,
> +		"copy codec firmware (%ld B) to phys 0x%llx",
> +		size, dev->firmware.paddr);
> +	memcpy(dev->firmware.vaddr, buf, size);
> +
> +	regmap_write(dev->regmap, AXI_ADDR_OFFSET_IP,
> +		     upper_32_bits(dev->firmware.paddr));
> +
> +	/*
> +	 * TODO I am not sure what the icache and dcache offsets are doing.
> +	 * Copy behavior from downstream driver for now.
> +	 */
> +	icache_offset = dev->firmware.paddr - MCU_CACHE_OFFSET;
> +	v4l2_dbg(2, debug, &dev->v4l2_dev,
> +		"icache_offset: msb = 0x%x, lsb = 0x%x\n",
> +		upper_32_bits(icache_offset), lower_32_bits(icache_offset));
> +	regmap_write(dev->regmap, AL5_ICACHE_ADDR_OFFSET_MSB,
> +		     upper_32_bits(icache_offset));
> +	regmap_write(dev->regmap, AL5_ICACHE_ADDR_OFFSET_LSB,
> +		     lower_32_bits(icache_offset));
> +
> +	dcache_offset =
> +	    (dev->firmware.paddr & 0xffffffff00000000) - MCU_CACHE_OFFSET;
> +	v4l2_dbg(2, debug, &dev->v4l2_dev,
> +		 "dcache_offset: msb = 0x%x, lsb = 0x%x\n",
> +		 upper_32_bits(dcache_offset), lower_32_bits(dcache_offset));
> +	regmap_write(dev->regmap, AL5_DCACHE_ADDR_OFFSET_MSB,
> +		     upper_32_bits(dcache_offset));
> +	regmap_write(dev->regmap, AL5_DCACHE_ADDR_OFFSET_LSB,
> +		     lower_32_bits(dcache_offset));
> +}
> +
> +static void allegro_free_fw_codec(struct allegro_dev *dev)
> +{
> +	allegro_free_buffer(dev, &dev->firmware);
> +}
> +
> +/*
> + * Control functions for the MCU
> + */
> +
> +static int allegro_mcu_enable_interrupts(struct allegro_dev *dev)
> +{
> +	return regmap_write(dev->regmap, AL5_ITC_CPU_IRQ_MSK, BIT(0));
> +}
> +
> +static int allegro_mcu_disable_interrupts(struct allegro_dev *dev)
> +{
> +	return regmap_write(dev->regmap, AL5_ITC_CPU_IRQ_MSK, 0);
> +}
> +
> +static int allegro_mcu_wait_for_sleep(struct allegro_dev *dev)
> +{
> +	unsigned long timeout;
> +	unsigned int status;
> +
> +	timeout = jiffies + msecs_to_jiffies(100);
> +	while (regmap_read(dev->regmap, AL5_MCU_STA, &status) == 0 &&
> +	       status != AL5_MCU_STA_SLEEP) {
> +		if (time_after(jiffies, timeout))
> +			return -ETIMEDOUT;
> +		cpu_relax();
> +	}
> +
> +	return 0;
> +}
> +
> +static int allegro_mcu_start(struct allegro_dev *dev)
> +{
> +	unsigned long timeout;
> +	unsigned int status;
> +	int err;
> +
> +	err = regmap_write(dev->regmap, AL5_MCU_WAKEUP, BIT(0));
> +	if (err)
> +		return err;
> +
> +	timeout = jiffies + msecs_to_jiffies(100);
> +	while (regmap_read(dev->regmap, AL5_MCU_STA, &status) == 0 &&
> +	       status == AL5_MCU_STA_SLEEP) {
> +		if (time_after(jiffies, timeout))
> +			return -ETIMEDOUT;
> +		cpu_relax();
> +	}
> +
> +	err = regmap_write(dev->regmap, AL5_MCU_WAKEUP, 0);
> +	if (err)
> +		return err;
> +
> +	return 0;
> +}
> +
> +static int allegro_mcu_reset(struct allegro_dev *dev)
> +{
> +	int err;
> +
> +	err = regmap_write(dev->regmap,
> +			   AL5_MCU_RESET_MODE, AL5_MCU_RESET_MODE_SLEEP);
> +	if (err < 0)
> +		return err;
> +
> +	err = regmap_write(dev->regmap, AL5_MCU_RESET, AL5_MCU_RESET_SOFT);
> +	if (err < 0)
> +		return err;
> +
> +	return allegro_mcu_wait_for_sleep(dev);
> +}
> +
> +/*
> + * Create the MCU channel
> + *
> + * After the channel has been created, the picture size, format, colorspace
> + * and framerate are fixed. Also the codec, profile, bitrate, etc. cannot be
> + * changed anymore.
> + *
> + * The channel can be created only once. The MCU will accept source buffers
> + * and stream buffers only after a channel has been created.
> + */
> +static int allegro_create_channel(struct allegro_channel *channel)
> +{
> +	struct allegro_dev *dev = channel->dev;
> +	unsigned long timeout;
> +	int err;
> +	enum v4l2_mpeg_video_h264_level min_level;
> +
> +	if (channel->created) {
> +		v4l2_warn(&dev->v4l2_dev,
> +			  "channel already has been created\n");
> +		return 0;
> +	}
> +
> +	channel->user_id = allegro_next_user_id(dev);
> +	if (channel->user_id < 0) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "no free channels available\n");
> +		return -EBUSY;
> +	}
> +	set_bit(channel->user_id, &dev->channel_user_ids);
> +
> +	v4l2_dbg(1, debug, &dev->v4l2_dev,
> +		 "user %d: creating channel (%4.4s, %dx%d@%d) \n",
> +		 channel->user_id,
> +		 (char *)&channel->codec, channel->width, channel->height, 25);
> +
> +	min_level = select_minimum_h264_level(channel->width, channel->height);
> +	if (channel->level < min_level) {
> +		v4l2_warn(&dev->v4l2_dev,
> +			  "user %d: selected Level %s too low: increasing to Level %s\n",
> +			  channel->user_id,
> +			  v4l2_ctrl_get_menu(V4L2_CID_MPEG_VIDEO_H264_LEVEL)[channel->level],
> +			  v4l2_ctrl_get_menu(V4L2_CID_MPEG_VIDEO_H264_LEVEL)[min_level]);
> +		channel->level = min_level;
> +	}
> +
> +	reinit_completion(&channel->completion);
> +	allegro_mcu_send_create_channel(dev, channel);
> +	timeout = wait_for_completion_timeout(&channel->completion,
> +					      msecs_to_jiffies(5000));
> +	if (timeout == 0) {
> +		err = -ETIMEDOUT;
> +		goto err;
> +	}
> +	if (!channel->created) {
> +		/* FIXME Return a proper error code */
> +		err = -1;
> +		goto err;
> +	}
> +
> +	v4l2_dbg(1, debug, &dev->v4l2_dev,
> +		 "channel %d: accepting buffers\n",
> +		 channel->mcu_channel_id);
> +
> +	return 0;
> +
> +err:
> +	clear_bit(channel->user_id, &dev->channel_user_ids);
> +	channel->user_id = -1;
> +	return err;
> +}
> +
> +static void allegro_destroy_channel(struct allegro_channel *channel)
> +{
> +	struct allegro_dev *dev = channel->dev;
> +	unsigned long timeout;
> +
> +	if (!channel->created) {
> +		v4l2_warn(&dev->v4l2_dev,
> +			  "cannot destroy channel: has not been created\n");
> +		return;
> +	}
> +
> +	reinit_completion(&channel->completion);
> +	allegro_mcu_send_destroy_channel(dev, channel);
> +	timeout = wait_for_completion_timeout(&channel->completion,
> +					      msecs_to_jiffies(5000));
> +	if (timeout == 0)
> +		v4l2_warn(&dev->v4l2_dev,
> +			  "timeout while destroying channel %d\n",
> +			  channel->mcu_channel_id);
> +
> +	channel->mcu_channel_id = -1;
> +	channel->created = false;
> +	clear_bit(channel->user_id, &dev->channel_user_ids);
> +	channel->user_id = -1;
> +
> +	destroy_intermediate_buffers(channel);
> +	destroy_reference_buffers(channel);
> +}
> +
> +static void allegro_set_default_params(struct allegro_channel *channel)
> +{
> +	channel->width = ALLEGRO_WIDTH_DEFAULT;
> +	channel->height = ALLEGRO_HEIGHT_DEFAULT;
> +	channel->stride = round_up(channel->width, 32);
> +
> +	channel->colorspace = V4L2_COLORSPACE_REC709;
> +	channel->ycbcr_enc = V4L2_YCBCR_ENC_DEFAULT;
> +	channel->quantization = V4L2_QUANTIZATION_DEFAULT;
> +	channel->xfer_func = V4L2_XFER_FUNC_DEFAULT;
> +
> +	channel->pixelformat = V4L2_PIX_FMT_NV12;
> +	channel->sizeimage_raw = channel->stride * channel->height * 3 / 2;
> +
> +	channel->codec = V4L2_PIX_FMT_H264;
> +	channel->profile = V4L2_MPEG_VIDEO_H264_PROFILE_BASELINE;
> +	channel->level =
> +		select_minimum_h264_level(channel->width, channel->height);
> +	channel->sizeimage_encoded =
> +		estimate_stream_size(channel->width, channel->height);
> +
> +	channel->bitrate_mode = V4L2_MPEG_VIDEO_BITRATE_MODE_CBR;
> +	channel->bitrate = maximum_bitrate(channel->level);
> +	channel->bitrate_peak = maximum_bitrate(channel->level);
> +	channel->cpb_size = maximum_cpb_size(channel->level);
> +	channel->gop_size = ALLEGRO_GOP_SIZE_DEFAULT;
> +}
> +
> +static int allegro_queue_setup(struct vb2_queue *vq,
> +			       unsigned int *nbuffers, unsigned int *nplanes,
> +			       unsigned int sizes[],
> +			       struct device *alloc_devs[])
> +{
> +	struct allegro_channel *channel = vb2_get_drv_priv(vq);
> +	struct allegro_dev *dev = channel->dev;
> +
> +	v4l2_dbg(2, debug, &dev->v4l2_dev,
> +		 "%s: queue setup[%s]: nplanes = %d\n",
> +		 vq->type == V4L2_BUF_TYPE_VIDEO_OUTPUT ? "output" : "capture",
> +		 *nplanes == 0 ? "REQBUFS" : "CREATE_BUFS", *nplanes);
> +
> +	if (*nplanes != 0) {
> +		if (*nplanes != 1)
> +			return -EINVAL;
> +		if (vq->type == V4L2_BUF_TYPE_VIDEO_CAPTURE) {
> +			if (sizes[0] < channel->sizeimage_encoded)
> +				return -EINVAL;
> +
> +		} else {
> +			if (sizes[0] < channel->sizeimage_raw)
> +				return -EINVAL;
> +		}
> +	} else {
> +		*nplanes = 1;
> +		if (vq->type == V4L2_BUF_TYPE_VIDEO_CAPTURE) {
> +			sizes[0] = channel->sizeimage_encoded;
> +		} else {
> +			sizes[0] = channel->sizeimage_raw;
> +		}
> +	}
> +
> +	return 0;
> +}
> +
> +static int allegro_buf_prepare(struct vb2_buffer *vb)
> +{
> +	struct vb2_v4l2_buffer *vbuf = to_vb2_v4l2_buffer(vb);
> +	struct allegro_channel *channel = vb2_get_drv_priv(vb->vb2_queue);
> +	struct allegro_dev *dev = channel->dev;
> +
> +	if (V4L2_TYPE_IS_OUTPUT(vb->vb2_queue->type)) {
> +		if (vbuf->field == V4L2_FIELD_ANY)
> +			vbuf->field = V4L2_FIELD_NONE;
> +		if (vbuf->field != V4L2_FIELD_NONE) {
> +			v4l2_err(&dev->v4l2_dev,
> +				 "channel %d: unsupported field\n",
> +				 channel->mcu_channel_id);
> +			return -EINVAL;
> +		}
> +	}
> +
> +	return 0;
> +}
> +
> +static void allegro_buf_queue(struct vb2_buffer *vb)
> +{
> +	struct allegro_channel *channel = vb2_get_drv_priv(vb->vb2_queue);
> +	struct allegro_dev *dev = channel->dev;
> +	struct vb2_queue *vq = vb->vb2_queue;
> +	struct allegro_buffer al_buf;
> +
> +	if (vq->type == V4L2_BUF_TYPE_VIDEO_CAPTURE) {
> +		al_buf.paddr = vb2_dma_contig_plane_dma_addr(vb, 0);
> +		al_buf.size = vb2_plane_size(vb, 0);
> +
> +		v4l2_dbg(1, debug, &dev->v4l2_dev,
> +			 "channel %d: queuing stream buffer: paddr: 0x%08llx, %ld bytes\n",
> +			 channel->mcu_channel_id, al_buf.paddr, al_buf.size);
> +		allegro_mcu_send_put_stream_buffer(dev, channel, al_buf);
> +	}
> +
> +	v4l2_m2m_buf_queue(channel->fh.m2m_ctx, to_vb2_v4l2_buffer(vb));
> +}
> +
> +static int allegro_start_streaming(struct vb2_queue *q, unsigned int count)
> +{
> +	struct allegro_channel *channel = vb2_get_drv_priv(q);
> +	struct allegro_dev *dev = channel->dev;
> +
> +	v4l2_dbg(2, debug, &dev->v4l2_dev,
> +		 "%s: start streaming\n",
> +		 q->type == V4L2_BUF_TYPE_VIDEO_OUTPUT ? "output" : "capture");
> +
> +	if (q->type == V4L2_BUF_TYPE_VIDEO_OUTPUT) {
> +		channel->osequence = 0;
> +	} else {
> +		channel->csequence = 0;
> +	}
> +
> +	return 0;
> +}
> +
> +static void allegro_stop_streaming(struct vb2_queue *q)
> +{
> +	struct allegro_channel *channel = vb2_get_drv_priv(q);
> +	struct allegro_dev *dev = channel->dev;
> +	struct vb2_v4l2_buffer *buffer;
> +
> +	v4l2_dbg(2, debug, &dev->v4l2_dev,
> +		 "%s: stop streaming\n",
> +		 q->type == V4L2_BUF_TYPE_VIDEO_OUTPUT ? "output" : "capture");
> +
> +	if (q->type == V4L2_BUF_TYPE_VIDEO_OUTPUT) {
> +		while ((buffer = v4l2_m2m_src_buf_remove(channel->fh.m2m_ctx)))
> +			v4l2_m2m_buf_done(buffer, VB2_BUF_STATE_ERROR);
> +	} else if (q->type == V4L2_BUF_TYPE_VIDEO_CAPTURE) {
> +		/*
> +		 * FIXME The channel is created during queue_setup().
> +		 * Therefore, destroying it in stop_streaming might result in
> +		 * a start_streaming() without a channel, which is really bad.
> +		 */

I think this is fixed now, is this an outdated comment?

There are a few FIXMEs and TODOs in this source, it's probably a good idea to
check them and fix them where it makes sense for v4.

> +		allegro_destroy_channel(channel);
> +		while ((buffer = v4l2_m2m_dst_buf_remove(channel->fh.m2m_ctx)))
> +			v4l2_m2m_buf_done(buffer, VB2_BUF_STATE_ERROR);
> +	}
> +}

Regards,

	Hans

Hans Verkuil Feb. 25, 2019, 11:13 a.m. UTC | #3

On 2/13/19 6:51 PM, Michael Tretter wrote:
> The allegro hardware encoder does not write SPS/PPS nal units into the
> encoded video stream. Therefore, we need to write the units in software.
> 
> The implementation follows Rec. ITU-T H.264 (04/2017) to allow to
> convert between a C struct and the RBSP representation of the SPS and
> PPS nal units.
> 
> The allegro driver writes the nal units into the v4l2 capture buffer in
> front of the actual video data which is written at an offset by the IP
> core. The remaining gap is filled with a filler nal unit.
> 
> Signed-off-by: Michael Tretter <m.tretter@pengutronix.de>
> ---
> v2 -> v3:
> none
> 
> v1 -> v2:
> - clean up debug log levels
> - fix missing error handling in allegro_h264_write_sps
> - enable configuration of frame size
> - enable configuration of bit rate and CPB size
> ---
>  drivers/staging/media/allegro-dvt/Makefile    |    4 +-
>  .../staging/media/allegro-dvt/allegro-core.c  |  182 ++-
>  drivers/staging/media/allegro-dvt/nal-h264.c  | 1278 +++++++++++++++++
>  drivers/staging/media/allegro-dvt/nal-h264.h  |  188 +++
>  4 files changed, 1650 insertions(+), 2 deletions(-)
>  create mode 100644 drivers/staging/media/allegro-dvt/nal-h264.c
>  create mode 100644 drivers/staging/media/allegro-dvt/nal-h264.h
> 
> diff --git a/drivers/staging/media/allegro-dvt/Makefile b/drivers/staging/media/allegro-dvt/Makefile
> index bc30addee47f..eee9713c10e3 100644
> --- a/drivers/staging/media/allegro-dvt/Makefile
> +++ b/drivers/staging/media/allegro-dvt/Makefile
> @@ -1,4 +1,6 @@
>  # SPDX-License-Identifier: GPL-2.0
> -allegro-objs := allegro-core.o
> +ccflags-y += -I$(src)
> +
> +allegro-objs := allegro-core.o nal-h264.o
>  
>  obj-$(CONFIG_VIDEO_ALLEGRO_DVT) += allegro.o
> diff --git a/drivers/staging/media/allegro-dvt/allegro-core.c b/drivers/staging/media/allegro-dvt/allegro-core.c
> index d0e1b2a574a6..8fa53c171110 100644
> --- a/drivers/staging/media/allegro-dvt/allegro-core.c
> +++ b/drivers/staging/media/allegro-dvt/allegro-core.c
> @@ -26,6 +26,8 @@
>  #include <media/videobuf2-dma-contig.h>
>  #include <media/videobuf2-v4l2.h>
>  
> +#include "nal-h264.h"
> +
>  /*
>   * PG252 June 6, 2018 (H.264/H.265 Video Codec Unit v1.1) Chapter 3
>   *
> @@ -1281,6 +1283,131 @@ static int allocate_reference_buffers(struct allegro_channel *channel,
>  					 n, PAGE_ALIGN(size));
>  }
>  
> +static ssize_t allegro_h264_write_sps(struct allegro_channel *channel,
> +				      void *dest, size_t n)
> +{
> +	struct allegro_dev *dev = channel->dev;
> +	struct nal_h264_sps *sps;
> +	ssize_t size;
> +	unsigned int size_mb = SIZE_MACROBLOCK;
> +	/* Calculation of crop units in Rec. ITU-T H.264 (04/2017) p. 76 */
> +	unsigned int crop_unit_x = 2;
> +	unsigned int crop_unit_y = 2;
> +
> +	sps = kzalloc(sizeof(*sps), GFP_KERNEL);
> +	if (!sps)
> +		return -ENOMEM;
> +
> +	sps->profile_idc = nal_h264_profile_from_v4l2(channel->profile);
> +	sps->constraint_set0_flag = 0;
> +	sps->constraint_set1_flag = 1;
> +	sps->constraint_set2_flag = 0;
> +	sps->constraint_set3_flag = 0;
> +	sps->constraint_set4_flag = 0;
> +	sps->constraint_set5_flag = 0;
> +	sps->level_idc = nal_h264_level_from_v4l2(channel->level);
> +	sps->seq_parameter_set_id = 0;
> +	sps->log2_max_frame_num_minus4 = 0;
> +	sps->pic_order_cnt_type = 0;
> +	sps->log2_max_pic_order_cnt_lsb_minus4 = 6;
> +	sps->max_num_ref_frames = 3;
> +	sps->gaps_in_frame_num_value_allowed_flag = 0;
> +	sps->pic_width_in_mbs_minus1 =
> +		DIV_ROUND_UP(channel->width, size_mb) - 1;
> +	sps->pic_height_in_map_units_minus1 =
> +		DIV_ROUND_UP(channel->height, size_mb) - 1;
> +	sps->frame_mbs_only_flag = 1;
> +	sps->mb_adaptive_frame_field_flag = 0;
> +	sps->direct_8x8_inference_flag = 1;
> +	sps->frame_cropping_flag =
> +		(channel->width % size_mb) || (channel->height % size_mb);
> +	if (sps->frame_cropping_flag) {
> +		sps->crop_left = 0;
> +		sps->crop_right = (round_up(channel->width, size_mb) - channel->width) / crop_unit_x;
> +		sps->crop_top = 0;
> +		sps->crop_bottom = (round_up(channel->height, size_mb) - channel->height) / crop_unit_y;
> +	}
> +	sps->vui_parameters_present_flag = 1;
> +	sps->vui.aspect_ratio_info_present_flag = 0;
> +	sps->vui.overscan_info_present_flag = 0;
> +	sps->vui.video_signal_type_present_flag = 1;
> +	sps->vui.video_format = 1;
> +	sps->vui.video_full_range_flag = 0;
> +	sps->vui.colour_description_present_flag = 1;
> +	sps->vui.colour_primaries = 5;
> +	sps->vui.transfer_characteristics = 5;
> +	sps->vui.matrix_coefficients = 5;
> +	sps->vui.chroma_loc_info_present_flag = 1;
> +	sps->vui.chroma_sample_loc_type_top_field = 0;
> +	sps->vui.chroma_sample_loc_type_bottom_field = 0;
> +	sps->vui.timing_info_present_flag = 1;
> +	sps->vui.num_units_in_tick = 1;
> +	sps->vui.time_scale = 50;
> +	sps->vui.fixed_frame_rate_flag = 1;
> +	sps->vui.nal_hrd_parameters_present_flag = 0;
> +	sps->vui.vcl_hrd_parameters_present_flag = 1;
> +	sps->vui.vcl_hrd_parameters.cpb_cnt_minus1 = 0;
> +	sps->vui.vcl_hrd_parameters.bit_rate_scale = 0;
> +	sps->vui.vcl_hrd_parameters.cpb_size_scale = 1;
> +	/* See Rec. ITU-T H.264 (04/2017) p. 410 E-53 */
> +	sps->vui.vcl_hrd_parameters.bit_rate_value_minus1[0] =
> +		channel->bitrate_peak / (1 << (6 + sps->vui.vcl_hrd_parameters.bit_rate_scale)) - 1;
> +	/* See Rec. ITU-T H.264 (04/2017) p. 410 E-54 */
> +	sps->vui.vcl_hrd_parameters.cpb_size_value_minus1[0] =
> +		(channel->cpb_size * 1000) / (1 << (4 + sps->vui.vcl_hrd_parameters.cpb_size_scale)) - 1;
> +	sps->vui.vcl_hrd_parameters.cbr_flag[0] = 1;
> +	sps->vui.vcl_hrd_parameters.initial_cpb_removal_delay_length_minus1 = 31;
> +	sps->vui.vcl_hrd_parameters.cpb_removal_delay_length_minus1 = 31;
> +	sps->vui.vcl_hrd_parameters.dpb_output_delay_length_minus1 = 31;
> +	sps->vui.vcl_hrd_parameters.time_offset_length = 0;
> +	sps->vui.low_delay_hrd_flag = 0;
> +	sps->vui.pic_struct_present_flag = 1;
> +	sps->vui.bitstream_restriction_flag = 0;
> +
> +	size = nal_h264_write_sps(&dev->plat_dev->dev, dest, n, sps);
> +
> +	kfree(sps);
> +
> +	return size;
> +}
> +
> +static ssize_t allegro_h264_write_pps(struct allegro_channel *channel,
> +				      void *dest, size_t n)
> +{
> +	struct allegro_dev *dev = channel->dev;
> +	struct nal_h264_pps *pps;
> +	ssize_t size;
> +
> +	pps = kzalloc(sizeof(*pps), GFP_KERNEL);
> +	if (!pps)
> +		return -ENOMEM;
> +
> +	pps->pic_parameter_set_id = 0;
> +	pps->seq_parameter_set_id = 0;
> +	pps->entropy_coding_mode_flag = 0;
> +	pps->bottom_field_pic_order_in_frame_present_flag = 0;
> +	pps->num_slice_groups_minus1 = 0;
> +	pps->num_ref_idx_l0_default_active_minus1 = 2;
> +	pps->num_ref_idx_l1_default_active_minus1 = 2;
> +	pps->weighted_pred_flag = 0;
> +	pps->weighted_bipred_idc = 0;
> +	pps->pic_init_qp_minus26 = 0;
> +	pps->pic_init_qs_minus26 = 0;
> +	pps->chroma_qp_index_offset = 0;
> +	pps->deblocking_filter_control_present_flag = 1;
> +	pps->constrained_intra_pred_flag = 0;
> +	pps->redundant_pic_cnt_present_flag = 0;
> +	pps->transform_8x8_mode_flag = 0;
> +	pps->pic_scaling_matrix_present_flag = 0;
> +	pps->second_chroma_qp_index_offset = 0;
> +
> +	size = nal_h264_write_pps(&dev->plat_dev->dev, dest, n, pps);
> +
> +	kfree(pps);
> +
> +	return size;
> +}
> +
>  static void allegro_finish_frame(struct allegro_channel *channel,
>  				 struct mcu_msg_encode_frame_response *msg)
>  {
> @@ -1292,6 +1419,9 @@ static void allegro_finish_frame(struct allegro_channel *channel,
>  		u32 size;
>  	} *partition;
>  	enum vb2_buffer_state state = VB2_BUF_STATE_ERROR;
> +	char *curr;
> +	ssize_t len;
> +	ssize_t free;
>  
>  	src_buf = v4l2_m2m_src_buf_remove(channel->fh.m2m_ctx);
>  
> @@ -1340,10 +1470,60 @@ static void allegro_finish_frame(struct allegro_channel *channel,
>  	 * Does the encoder add any data before its configured offset that we
>  	 * need to handle?
>  	 */
> -
>  	vb2_set_plane_payload(&dst_buf->vb2_buf, 0,
>  			      partition->offset + partition->size);
>  
> +	curr = vb2_plane_vaddr(&dst_buf->vb2_buf, 0);
> +	free = partition->offset;
> +	if (msg->is_idr) {
> +		len = allegro_h264_write_sps(channel, curr, free);
> +		if (len < 0) {
> +			v4l2_err(&dev->v4l2_dev,
> +				 "not enough space for sequence parameter set: %ld left\n",
> +				 free);
> +			goto err;
> +		}
> +		curr += len;
> +		free -= len;
> +		v4l2_dbg(1, debug, &dev->v4l2_dev,
> +			 "channel %d: wrote %ld byte SPS nal unit\n",
> +			 channel->mcu_channel_id, len);
> +	}
> +
> +	if (msg->slice_type == AL_ENC_SLICE_TYPE_I) {
> +		len = allegro_h264_write_pps(channel, curr, free);
> +		if (len < 0) {
> +			v4l2_err(&dev->v4l2_dev,
> +				 "not enough space for picture parameter set: %ld left\n",
> +				 free);
> +			goto err;
> +		}
> +		curr += len;
> +		free -= len;
> +		v4l2_dbg(1, debug, &dev->v4l2_dev,
> +			 "channel %d: wrote %ld byte PPS nal unit\n",
> +			 channel->mcu_channel_id, len);
> +	}
> +
> +	len = nal_h264_write_filler(&dev->plat_dev->dev, curr, free);
> +	if (len < 0) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "failed to write %ld filler data\n", free);
> +		goto err;
> +	}
> +	curr += len;
> +	free -= len;
> +	v4l2_dbg(2, debug, &dev->v4l2_dev,
> +		 "channel %d: wrote %ld bytes filler nal unit\n",
> +		 channel->mcu_channel_id, len);
> +
> +	if (free != 0) {
> +		v4l2_err(&dev->v4l2_dev,
> +			 "non-VCL NAL units do not fill space until VCL NAL unit: %ld bytes left\n",
> +			 free);
> +		goto err;
> +	}
> +
>  	state = VB2_BUF_STATE_DONE;
>  
>  	dst_buf->vb2_buf.timestamp = src_buf->vb2_buf.timestamp;
> diff --git a/drivers/staging/media/allegro-dvt/nal-h264.c b/drivers/staging/media/allegro-dvt/nal-h264.c
> new file mode 100644
> index 000000000000..83bc98200c1a
> --- /dev/null
> +++ b/drivers/staging/media/allegro-dvt/nal-h264.c
> @@ -0,0 +1,1278 @@
> +// SPDX-License-Identifier: GPL-2.0
> +/*
> + * Copyright (C) 2019 Pengutronix, Michael Tretter <kernel@pengutronix.de>
> + *
> + * Convert NAL units between raw byte sequence payloads (RBSP) and C structs
> + *
> + * The conversion is defined in "ITU-T Rec. H.264 (04/2017) Advanced video
> + * coding for generic audiovisual services". Decoder drivers may use the
> + * parser to parse RBSP from encoded streams and configure the hardware, if
> + * the hardware is not able to parse RBSP itself.  Encoder drivers may use the
> + * generator to generate the RBSP for SPS/PPS nal units and add them to the
> + * encoded stream if the hardware does not generate the units.
> + */
> +
> +#include <linux/kernel.h>
> +#include <linux/types.h>
> +#include <linux/string.h>
> +#include <linux/v4l2-controls.h>
> +
> +#include <linux/device.h>
> +#include <linux/export.h>
> +#include <linux/log2.h>
> +
> +#include <nal-h264.h>
> +
> +struct rbsp {
> +	char *buf;
> +	int size;
> +	int pos;
> +	int num_consecutive_zeros;
> +};
> +
> +int nal_h264_profile_from_v4l2(enum v4l2_mpeg_video_h264_profile profile)
> +{
> +	switch (profile) {
> +	case V4L2_MPEG_VIDEO_H264_PROFILE_BASELINE:
> +		return 66;
> +	case V4L2_MPEG_VIDEO_H264_PROFILE_MAIN:
> +		return 77;
> +	case V4L2_MPEG_VIDEO_H264_PROFILE_EXTENDED:
> +		return 88;
> +	case V4L2_MPEG_VIDEO_H264_PROFILE_HIGH:
> +		return 100;
> +	default:
> +		return -EINVAL;
> +	}
> +}
> +
> +int nal_h264_level_from_v4l2(enum v4l2_mpeg_video_h264_level level)
> +{
> +	switch (level) {
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_1_0:
> +		return 10;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_1B:
> +		return 9;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_1_1:
> +		return 11;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_1_2:
> +		return 12;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_1_3:
> +		return 13;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_2_0:
> +		return 20;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_2_1:
> +		return 21;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_2_2:
> +		return 22;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_3_0:
> +		return 30;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_3_1:
> +		return 31;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_3_2:
> +		return 32;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_4_0:
> +		return 40;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_4_1:
> +		return 41;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_4_2:
> +		return 42;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_5_0:
> +		return 50;
> +	case V4L2_MPEG_VIDEO_H264_LEVEL_5_1:
> +		return 51;
> +	default:
> +		return -EINVAL;
> +	}
> +}
> +
> +static int rbsp_read_bits(struct rbsp *rbsp, int num, int *val);
> +static int rbsp_write_bits(struct rbsp *rbsp, int num, int val);
> +
> +static int add_emulation_prevention_three_byte(struct rbsp *rbsp)
> +{
> +	rbsp->num_consecutive_zeros = 0;
> +	/*
> +	 * We are not actually the emulation_prevention_three_byte, but the 2
> +	 * one bits of the byte and the 6 zero bits of the next byte.
> +	 * Therefore, the discarded byte shifted by 6 bits.
> +	 */
> +	rbsp_write_bits(rbsp, 8, (0x3 << 6));
> +
> +	return 0;
> +}
> +
> +static int discard_emulation_prevention_three_byte(struct rbsp *rbsp)
> +{
> +	unsigned int tmp = 0;
> +
> +	rbsp->num_consecutive_zeros = 0;
> +	/*
> +	 * We are not actually discarding the emulation_prevention_three_byte,
> +	 * but the 2 one bits of the byte and the 6 zero bits of the next
> +	 * byte. Therefore, the discarded byte shifted by 6 bits.
> +	 */
> +	rbsp_read_bits(rbsp, 8, &tmp);
> +	if (tmp != (0x3 << 6))
> +		return -EINVAL;
> +
> +	return 0;
> +}
> +
> +static inline int rbsp_read_bit(struct rbsp *rbsp)
> +{
> +	int shift;
> +	int ofs;
> +	int bit;
> +	int err;
> +
> +	if (rbsp->num_consecutive_zeros == 22) {
> +		err = discard_emulation_prevention_three_byte(rbsp);
> +		if (err)
> +			return err;
> +	}
> +
> +	shift = 7 - (rbsp->pos % 8);
> +	ofs = rbsp->pos++ / 8;
> +
> +	if (ofs >= rbsp->size)
> +		return -EINVAL;
> +
> +	bit = (rbsp->buf[ofs] >> shift) & 1;
> +
> +	/*
> +	 * Counting zeros for the emulation_prevention_three_byte only starts
> +	 * at byte boundaries.
> +	 */
> +	if (bit == 1 ||
> +	    (rbsp->num_consecutive_zeros < 7 && (rbsp->pos % 8 == 0)))
> +		rbsp->num_consecutive_zeros = 0;
> +	else
> +		rbsp->num_consecutive_zeros++;
> +
> +	return bit;
> +}
> +
> +static inline int rbsp_write_bit(struct rbsp *rbsp, int bit)
> +{
> +	int shift;
> +	int ofs;
> +
> +	if (rbsp->num_consecutive_zeros == 22)
> +		add_emulation_prevention_three_byte(rbsp);
> +
> +	shift = 7 - (rbsp->pos % 8);
> +	ofs = rbsp->pos++ / 8;
> +
> +	if (ofs >= rbsp->size)
> +		return -EINVAL;
> +
> +	rbsp->buf[ofs] &= ~(1 << shift);
> +	rbsp->buf[ofs] |= bit << shift;
> +
> +	/*
> +	 * Counting zeros for the emulation_prevention_three_byte only starts
> +	 * at byte boundaries.
> +	 */
> +	if (bit == 1 ||
> +	    (rbsp->num_consecutive_zeros < 7 && (rbsp->pos % 8 == 0))) {
> +		rbsp->num_consecutive_zeros = 0;
> +	} else {
> +		rbsp->num_consecutive_zeros++;
> +	}
> +
> +	return 0;
> +}
> +
> +static inline int rbsp_read_bits(struct rbsp *rbsp, int num, int *val)
> +{
> +	int i, ret;
> +	int tmp = 0;
> +
> +	if (num > 32)
> +		return -EINVAL;
> +
> +	for (i = 0; i < num; i++) {
> +		ret = rbsp_read_bit(rbsp);
> +		if (ret < 0)
> +			return ret;
> +		tmp |= ret << (num - i - 1);
> +	}
> +
> +	if (val)
> +		*val = tmp;
> +
> +	return 0;
> +}
> +
> +static int rbsp_write_bits(struct rbsp *rbsp, int num, int value)
> +{
> +	int ret;
> +
> +	while (num--) {
> +		ret = rbsp_write_bit(rbsp, (value >> num) & 1);
> +		if (ret)
> +			return ret;
> +	}
> +
> +	return 0;
> +}
> +
> +static int rbsp_read_uev(struct rbsp *rbsp, unsigned int *val)
> +{
> +	int leading_zero_bits = 0;
> +	unsigned int tmp = 0;
> +	int ret;
> +
> +	while ((ret = rbsp_read_bit(rbsp)) == 0)
> +		leading_zero_bits++;
> +	if (ret < 0)
> +		return ret;
> +
> +	if (leading_zero_bits > 0) {
> +		ret = rbsp_read_bits(rbsp, leading_zero_bits, &tmp);
> +		if (ret)
> +			return ret;
> +	}
> +
> +	if (val)
> +		*val = (1 << leading_zero_bits) - 1 + tmp;
> +
> +	return 0;
> +}
> +
> +static int rbsp_write_uev(struct rbsp *rbsp, unsigned int value)
> +{
> +	int i;
> +	int ret;
> +	int tmp = value + 1;
> +	int leading_zero_bits = fls(tmp) - 1;
> +
> +	for (i = 0; i < leading_zero_bits; i++) {
> +		ret = rbsp_write_bit(rbsp, 0);
> +		if (ret)
> +			return ret;
> +	}
> +
> +	return rbsp_write_bits(rbsp, leading_zero_bits + 1, tmp);
> +}
> +
> +static int rbsp_read_sev(struct rbsp *rbsp, int *val)
> +{
> +	unsigned int tmp;
> +	int ret;
> +
> +	ret = rbsp_read_uev(rbsp, &tmp);
> +	if (ret)
> +		return ret;
> +
> +	if (val) {
> +		if (tmp & 1)
> +			*val = (tmp + 1) / 2;
> +		else
> +			*val = -(tmp / 2);
> +	}
> +
> +	return 0;
> +}
> +
> +static int rbsp_write_sev(struct rbsp *rbsp, int val)
> +{
> +	unsigned int tmp;
> +
> +	if (val > 0)
> +		tmp = (2 * val) | 1;
> +	else
> +		tmp = -2 * val;
> +
> +	return rbsp_write_uev(rbsp, tmp);
> +}
> +
> +#define READ_BIT(field)						\
> +	do {							\
> +		int ret = rbsp_read_bit(rbsp);			\
> +		if (ret < 0)					\
> +			return ret;				\
> +		s->field = ret;					\
> +	} while (0)
> +
> +#define READ_BITS(num, field)					\
> +	do {							\
> +		int val;					\
> +		int ret = rbsp_read_bits(rbsp, (num), &val);	\
> +		if (ret)					\
> +			return ret;				\
> +		s->field = val;					\
> +	} while (0)
> +
> +#define READ_UEV(field)						\
> +	do {							\
> +		int ret = rbsp_read_uev(rbsp, &s->field);	\
> +		if (ret)					\
> +			return ret;				\
> +	} while (0)
> +
> +#define READ_SEV(field)						\
> +	do {							\
> +		int ret = rbsp_read_sev(rbsp, &s->field);	\
> +		if (ret)					\
> +			return ret;				\
> +	} while (0)
> +
> +#define WRITE_BIT(field)						\
> +	do {								\
> +		int ret = rbsp_write_bit(rbsp, s->field);		\
> +		if (ret < 0)						\
> +			return ret;					\
> +	} while (0)
> +
> +#define WRITE_BITS(num, field)						\
> +	do {								\
> +		int ret = rbsp_write_bits(rbsp, (num), s->field);	\
> +		if (ret)						\
> +			return ret;					\
> +	} while (0)
> +
> +#define WRITE_UEV(field)						\
> +	do {								\
> +		int ret = rbsp_write_uev(rbsp, s->field);		\
> +		if (ret)						\
> +			return ret;					\
> +	} while (0)
> +
> +#define WRITE_SEV(field)						\
> +	do {								\
> +		int ret = rbsp_write_sev(rbsp, s->field);		\
> +		if (ret)						\
> +			return ret;					\
> +	} while (0)
> +
> +#define PRINT_BIT(field)						\
> +		dev_dbg(dev, "field: %u\n", s->field)			\
> +
> +#define PRINT_BITS(num, field)						\
> +		dev_dbg(dev, "field: %u\n", s->field)			\
> +
> +#define PRINT_UEV(field)						\
> +		dev_dbg(dev, "field: %u\n", s->field)			\
> +
> +#define PRINT_SEV(field)						\
> +		dev_dbg(dev, "field: %d\n", s->field)			\
> +
> +static int nal_h264_write_trailing_bits(const struct device *dev,
> +					struct rbsp *rbsp)
> +{
> +	rbsp_write_bit(rbsp, 1);
> +	while (rbsp->pos % 8)
> +		rbsp_write_bit(rbsp, 0);
> +
> +	return 0;
> +}
> +
> +static int nal_h264_write_hrd_parameters(const struct device *dev,
> +					 struct rbsp *rbsp,
> +					 struct nal_h264_hrd_parameters *hrd)
> +{
> +	struct nal_h264_hrd_parameters *s = hrd;
> +	int i;
> +
> +	WRITE_UEV(cpb_cnt_minus1);
> +	WRITE_BITS(4, bit_rate_scale);
> +	WRITE_BITS(4, cpb_size_scale);
> +
> +	for (i = 0; i <= hrd->cpb_cnt_minus1; i++) {
> +		WRITE_UEV(bit_rate_value_minus1[i]);
> +		WRITE_UEV(cpb_size_value_minus1[i]);
> +		WRITE_BIT(cbr_flag[i]);
> +	}
> +
> +	WRITE_BITS(5, initial_cpb_removal_delay_length_minus1);
> +	WRITE_BITS(5, cpb_removal_delay_length_minus1);
> +	WRITE_BITS(5, dpb_output_delay_length_minus1);
> +	WRITE_BITS(5, time_offset_length);
> +
> +	return 0;
> +}
> +
> +static int nal_h264_read_hrd_parameters(const struct device *dev,
> +					struct rbsp *rbsp,
> +					struct nal_h264_hrd_parameters *hrd)
> +{
> +	struct nal_h264_hrd_parameters *s = hrd;
> +	unsigned int i;
> +
> +	READ_UEV(cpb_cnt_minus1);
> +	READ_BITS(4, bit_rate_scale);
> +	READ_BITS(4, cpb_size_scale);
> +
> +	for (i = 0; i <= hrd->cpb_cnt_minus1; i++) {
> +		READ_UEV(bit_rate_value_minus1[i]);
> +		READ_UEV(cpb_size_value_minus1[i]);
> +		READ_BIT(cbr_flag[i]);
> +	}
> +
> +	READ_BITS(5, initial_cpb_removal_delay_length_minus1);
> +	READ_BITS(5, cpb_removal_delay_length_minus1);
> +	READ_BITS(5, dpb_output_delay_length_minus1);
> +	READ_BITS(5, time_offset_length);
> +
> +	return 0;
> +}
> +
> +static void nal_h264_print_hrd_parameters(const struct device *dev,
> +					  struct nal_h264_hrd_parameters *hrd)
> +{
> +	struct nal_h264_hrd_parameters *s = hrd;
> +	unsigned int i;
> +
> +	if (!hrd)
> +		return;
> +
> +	PRINT_UEV(cpb_cnt_minus1);
> +	PRINT_BITS(4, bit_rate_scale);
> +	PRINT_BITS(4, cpb_size_scale);
> +
> +	for (i = 0; i <= s->cpb_cnt_minus1; i++) {
> +		PRINT_UEV(bit_rate_value_minus1[i]);
> +		PRINT_UEV(cpb_size_value_minus1[i]);
> +		PRINT_BIT(cbr_flag[i]);
> +	}
> +
> +	PRINT_BITS(5, initial_cpb_removal_delay_length_minus1);
> +	PRINT_BITS(5, cpb_removal_delay_length_minus1);
> +	PRINT_BITS(5, dpb_output_delay_length_minus1);
> +	PRINT_BITS(5, time_offset_length);
> +}
> +
> +static int nal_h264_read_vui_parameters(const struct device *dev,
> +					struct rbsp *rbsp,
> +					struct nal_h264_vui_parameters *vui)
> +{
> +	struct nal_h264_vui_parameters *s = vui;
> +	int err;
> +
> +	READ_BIT(aspect_ratio_info_present_flag);
> +	if (vui->aspect_ratio_info_present_flag) {
> +		READ_BITS(8, aspect_ratio_idc);
> +		if (vui->aspect_ratio_idc == 255) {
> +			READ_BITS(16, sar_width);
> +			READ_BITS(16, sar_height);
> +		}
> +	}
> +
> +	READ_BIT(overscan_info_present_flag);
> +	if (vui->overscan_info_present_flag)
> +		READ_BIT(overscan_appropriate_flag);
> +
> +	READ_BIT(video_signal_type_present_flag);
> +	if (vui->video_signal_type_present_flag) {
> +		READ_BITS(3, video_format);
> +		READ_BIT(video_full_range_flag);
> +		READ_BIT(colour_description_present_flag);
> +
> +		if (vui->colour_description_present_flag) {
> +			READ_BITS(8, colour_primaries);
> +			READ_BITS(8, transfer_characteristics);
> +			READ_BITS(8, matrix_coefficients);
> +		}
> +	}
> +
> +	READ_BIT(chroma_loc_info_present_flag);
> +	if (vui->chroma_loc_info_present_flag) {
> +		READ_UEV(chroma_sample_loc_type_top_field);
> +		READ_UEV(chroma_sample_loc_type_bottom_field);
> +	}
> +
> +	READ_BIT(timing_info_present_flag);
> +	if (vui->timing_info_present_flag) {
> +		READ_BITS(32, num_units_in_tick);
> +		READ_BITS(32, time_scale);
> +		READ_BIT(fixed_frame_rate_flag);
> +	}
> +
> +	READ_BIT(nal_hrd_parameters_present_flag);
> +	if (vui->nal_hrd_parameters_present_flag) {
> +		err = nal_h264_read_hrd_parameters(dev, rbsp,
> +						   &vui->nal_hrd_parameters);
> +		if (err)
> +			return err;
> +	}
> +
> +	READ_BIT(vcl_hrd_parameters_present_flag);
> +	if (vui->vcl_hrd_parameters_present_flag) {
> +		err = nal_h264_read_hrd_parameters(dev, rbsp,
> +						   &vui->vcl_hrd_parameters);
> +		if (err)
> +			return err;
> +	}
> +
> +	if (vui->nal_hrd_parameters_present_flag ||
> +	    vui->vcl_hrd_parameters_present_flag)
> +		READ_BIT(low_delay_hrd_flag);
> +
> +	READ_BIT(pic_struct_present_flag);
> +
> +	READ_BIT(bitstream_restriction_flag);
> +	if (vui->bitstream_restriction_flag) {
> +		READ_BIT(motion_vectors_over_pic_boundaries_flag);
> +		READ_UEV(max_bytes_per_pic_denom);
> +		READ_UEV(max_bits_per_mb_denom);
> +		READ_UEV(log2_max_mv_length_horizontal);
> +		READ_UEV(log21_max_mv_length_vertical);
> +		READ_UEV(max_num_reorder_frames);
> +		READ_UEV(max_dec_frame_buffering);
> +	}
> +
> +	return 0;
> +}
> +
> +static ssize_t nal_h264_write_vui_parameters(const struct device *dev,
> +					     struct rbsp *rbsp,
> +					     struct nal_h264_vui_parameters *vui)
> +{
> +	struct nal_h264_vui_parameters *s = vui;
> +	int err;
> +
> +	WRITE_BIT(aspect_ratio_info_present_flag);
> +	if (vui->aspect_ratio_info_present_flag) {
> +		WRITE_BITS(8, aspect_ratio_idc);
> +		if (vui->aspect_ratio_idc == 255) {
> +			WRITE_BITS(16, sar_width);
> +			WRITE_BITS(16, sar_height);
> +		}
> +	}
> +
> +	WRITE_BIT(overscan_info_present_flag);
> +	if (vui->overscan_info_present_flag)
> +		WRITE_BIT(overscan_appropriate_flag);
> +
> +	WRITE_BIT(video_signal_type_present_flag);
> +	if (vui->video_signal_type_present_flag) {
> +		WRITE_BITS(3, video_format);
> +		WRITE_BIT(video_full_range_flag);
> +		WRITE_BIT(colour_description_present_flag);
> +
> +		if (vui->colour_description_present_flag) {
> +			WRITE_BITS(8, colour_primaries);
> +			WRITE_BITS(8, transfer_characteristics);
> +			WRITE_BITS(8, matrix_coefficients);
> +		}
> +	}
> +
> +	WRITE_BIT(chroma_loc_info_present_flag);
> +	if (vui->chroma_loc_info_present_flag) {
> +		WRITE_UEV(chroma_sample_loc_type_top_field);
> +		WRITE_UEV(chroma_sample_loc_type_bottom_field);
> +	}
> +
> +	WRITE_BIT(timing_info_present_flag);
> +	if (vui->timing_info_present_flag) {
> +		WRITE_BITS(32, num_units_in_tick);
> +		WRITE_BITS(32, time_scale);
> +		WRITE_BIT(fixed_frame_rate_flag);
> +	}
> +
> +	WRITE_BIT(nal_hrd_parameters_present_flag);
> +	if (vui->nal_hrd_parameters_present_flag) {
> +		err = nal_h264_write_hrd_parameters(dev, rbsp,
> +						    &vui->nal_hrd_parameters);
> +		if (err)
> +			return err;
> +	}
> +
> +	WRITE_BIT(vcl_hrd_parameters_present_flag);
> +	if (vui->vcl_hrd_parameters_present_flag) {
> +		err = nal_h264_write_hrd_parameters(dev, rbsp,
> +						    &vui->vcl_hrd_parameters);
> +		if (err)
> +			return err;
> +	}
> +
> +	if (vui->nal_hrd_parameters_present_flag ||
> +	    vui->vcl_hrd_parameters_present_flag)
> +		WRITE_BIT(low_delay_hrd_flag);
> +
> +	WRITE_BIT(pic_struct_present_flag);
> +
> +	WRITE_BIT(bitstream_restriction_flag);
> +	if (vui->bitstream_restriction_flag) {
> +		WRITE_BIT(motion_vectors_over_pic_boundaries_flag);
> +		WRITE_UEV(max_bytes_per_pic_denom);
> +		WRITE_UEV(max_bits_per_mb_denom);
> +		WRITE_UEV(log2_max_mv_length_horizontal);
> +		WRITE_UEV(log21_max_mv_length_vertical);
> +		WRITE_UEV(max_num_reorder_frames);
> +		WRITE_UEV(max_dec_frame_buffering);
> +	}
> +
> +	return 0;
> +}
> +
> +static void nal_h264_print_vui_parameters(const struct device *dev,
> +					  struct nal_h264_vui_parameters *vui)
> +{
> +	struct nal_h264_vui_parameters *s = vui;
> +
> +	if (!vui)
> +		return;
> +
> +	PRINT_BIT(aspect_ratio_info_present_flag);
> +	if (vui->aspect_ratio_info_present_flag) {
> +		PRINT_BITS(8, aspect_ratio_idc);
> +		if (vui->aspect_ratio_idc == 255) {
> +			PRINT_BITS(16, sar_width);
> +			PRINT_BITS(16, sar_height);
> +		}
> +	}
> +
> +	PRINT_BIT(overscan_info_present_flag);
> +	if (vui->overscan_info_present_flag)
> +		PRINT_BIT(overscan_appropriate_flag);
> +
> +	PRINT_BIT(video_signal_type_present_flag);
> +	if (vui->video_signal_type_present_flag) {
> +		PRINT_BITS(3, video_format);
> +		PRINT_BIT(video_full_range_flag);
> +		PRINT_BIT(colour_description_present_flag);
> +
> +		if (vui->colour_description_present_flag) {
> +			PRINT_BITS(8, colour_primaries);
> +			PRINT_BITS(8, transfer_characteristics);
> +			PRINT_BITS(8, matrix_coefficients);
> +		}
> +	}
> +
> +	PRINT_BIT(chroma_loc_info_present_flag);
> +	if (vui->chroma_loc_info_present_flag) {
> +		PRINT_UEV(chroma_sample_loc_type_top_field);
> +		PRINT_UEV(chroma_sample_loc_type_bottom_field);
> +	}
> +
> +	PRINT_BIT(timing_info_present_flag);
> +	if (vui->timing_info_present_flag) {
> +		PRINT_BITS(32, num_units_in_tick);
> +		PRINT_BITS(32, time_scale);
> +		PRINT_BIT(fixed_frame_rate_flag);
> +	}
> +
> +	PRINT_BIT(nal_hrd_parameters_present_flag);
> +	if (vui->nal_hrd_parameters_present_flag)
> +		nal_h264_print_hrd_parameters(dev, &vui->nal_hrd_parameters);
> +
> +	PRINT_BIT(vcl_hrd_parameters_present_flag);
> +	if (vui->vcl_hrd_parameters_present_flag)
> +		nal_h264_print_hrd_parameters(dev, &vui->vcl_hrd_parameters);
> +
> +	if (vui->nal_hrd_parameters_present_flag ||
> +	    vui->vcl_hrd_parameters_present_flag)
> +		PRINT_BIT(low_delay_hrd_flag);
> +
> +	PRINT_BIT(pic_struct_present_flag);
> +
> +	PRINT_BIT(bitstream_restriction_flag);
> +	if (vui->bitstream_restriction_flag) {
> +		PRINT_BIT(motion_vectors_over_pic_boundaries_flag);
> +		PRINT_UEV(max_bytes_per_pic_denom);
> +		PRINT_UEV(max_bits_per_mb_denom);
> +		PRINT_UEV(log2_max_mv_length_horizontal);
> +		PRINT_UEV(log21_max_mv_length_vertical);
> +		PRINT_UEV(max_num_reorder_frames);
> +		PRINT_UEV(max_dec_frame_buffering);
> +	}
> +}
> +
> +static int nal_h264_rbsp_write_sps(const struct device *dev,
> +				   struct rbsp *rbsp, struct nal_h264_sps *sps)
> +{
> +	struct nal_h264_sps *s = sps;
> +	unsigned int i;
> +	int err;
> +
> +	if (rbsp->size < 3)
> +		return -EINVAL;
> +
> +	WRITE_BITS(8, profile_idc);
> +	WRITE_BIT(constraint_set0_flag);
> +	WRITE_BIT(constraint_set1_flag);
> +	WRITE_BIT(constraint_set2_flag);
> +	WRITE_BIT(constraint_set3_flag);
> +	WRITE_BIT(constraint_set4_flag);
> +	WRITE_BIT(constraint_set5_flag);
> +	WRITE_BITS(2, reserved_zero_2bits);
> +	WRITE_BITS(8, level_idc);
> +
> +	WRITE_UEV(seq_parameter_set_id);
> +
> +	if (sps->profile_idc == 100 || sps->profile_idc == 110 ||
> +	    sps->profile_idc == 122 || sps->profile_idc == 244 ||
> +	    sps->profile_idc == 44 || sps->profile_idc == 83 ||
> +	    sps->profile_idc == 86 || sps->profile_idc == 118 ||
> +	    sps->profile_idc == 128 || sps->profile_idc == 138 ||
> +	    sps->profile_idc == 139 || sps->profile_idc == 134 ||
> +	    sps->profile_idc == 135) {
> +		WRITE_UEV(chroma_format_idc);
> +
> +		if (sps->chroma_format_idc == 3)
> +			WRITE_BIT(separate_colour_plane_flag);
> +
> +		WRITE_UEV(bit_depth_luma_minus8);
> +		WRITE_UEV(bit_depth_chroma_minus8);
> +		WRITE_BIT(qpprime_y_zero_transform_bypass_flag);
> +		WRITE_BIT(seq_scaling_matrix_present_flag);
> +
> +		if (sps->seq_scaling_matrix_present_flag) {
> +			dev_err(dev,
> +				"%s: Handling scaling matrix not supported\n",
> +				__func__);
> +			return -EINVAL;
> +		}
> +	}
> +
> +	WRITE_UEV(log2_max_frame_num_minus4);
> +
> +	WRITE_UEV(pic_order_cnt_type);
> +	if (sps->pic_order_cnt_type == 0) {
> +		WRITE_UEV(log2_max_pic_order_cnt_lsb_minus4);
> +	} else if (sps->pic_order_cnt_type == 1) {
> +		WRITE_BIT(delta_pic_order_always_zero_flag);
> +		WRITE_SEV(offset_for_non_ref_pic);
> +		WRITE_SEV(offset_for_top_to_bottom_field);
> +
> +		WRITE_UEV(num_ref_frames_in_pic_order_cnt_cycle);
> +		for (i = 0; i < sps->num_ref_frames_in_pic_order_cnt_cycle; i++)
> +			WRITE_SEV(offset_for_ref_frame[i]);
> +	} else {
> +		dev_err(dev,
> +			"%s: Invalid pic_order_cnt_type %u\n", __func__,
> +			sps->pic_order_cnt_type);
> +		return -EINVAL;
> +	}
> +
> +	WRITE_UEV(max_num_ref_frames);
> +	WRITE_BIT(gaps_in_frame_num_value_allowed_flag);
> +	WRITE_UEV(pic_width_in_mbs_minus1);
> +	WRITE_UEV(pic_height_in_map_units_minus1);
> +
> +	WRITE_BIT(frame_mbs_only_flag);
> +	if (!sps->frame_mbs_only_flag)
> +		WRITE_BIT(mb_adaptive_frame_field_flag);
> +
> +	WRITE_BIT(direct_8x8_inference_flag);
> +
> +	WRITE_BIT(frame_cropping_flag);
> +	if (sps->frame_cropping_flag) {
> +		WRITE_UEV(crop_left);
> +		WRITE_UEV(crop_right);
> +		WRITE_UEV(crop_top);
> +		WRITE_UEV(crop_bottom);
> +	}
> +
> +	WRITE_BIT(vui_parameters_present_flag);
> +	if (sps->vui_parameters_present_flag) {
> +		err = nal_h264_write_vui_parameters(dev, rbsp, &sps->vui);
> +		if (err)
> +			return err;
> +	}
> +
> +	return 0;
> +}
> +
> +static int nal_h264_rbsp_read_sps(const struct device *dev,
> +				  struct rbsp *rbsp, struct nal_h264_sps *sps)
> +{
> +	struct nal_h264_sps *s = sps;
> +	unsigned int i;
> +	int err;
> +
> +	if (rbsp->size < 3)
> +		return -EINVAL;
> +
> +	READ_BITS(8, profile_idc);
> +	READ_BIT(constraint_set0_flag);
> +	READ_BIT(constraint_set1_flag);
> +	READ_BIT(constraint_set2_flag);
> +	READ_BIT(constraint_set3_flag);
> +	READ_BIT(constraint_set4_flag);
> +	READ_BIT(constraint_set5_flag);
> +	READ_BITS(2, reserved_zero_2bits);
> +	READ_BITS(8, level_idc);
> +
> +	READ_UEV(seq_parameter_set_id);
> +
> +	if (sps->profile_idc == 100 || sps->profile_idc == 110 ||
> +	    sps->profile_idc == 122 || sps->profile_idc == 244 ||
> +	    sps->profile_idc == 44 || sps->profile_idc == 83 ||
> +	    sps->profile_idc == 86 || sps->profile_idc == 118 ||
> +	    sps->profile_idc == 128 || sps->profile_idc == 138 ||
> +	    sps->profile_idc == 139 || sps->profile_idc == 134 ||
> +	    sps->profile_idc == 135) {
> +		READ_UEV(chroma_format_idc);
> +
> +		if (sps->chroma_format_idc == 3)
> +			READ_BIT(separate_colour_plane_flag);
> +
> +		READ_UEV(bit_depth_luma_minus8);
> +		READ_UEV(bit_depth_chroma_minus8);
> +		READ_BIT(qpprime_y_zero_transform_bypass_flag);
> +		READ_BIT(seq_scaling_matrix_present_flag);
> +
> +		if (sps->seq_scaling_matrix_present_flag) {
> +			dev_err(dev,
> +				"%s: Handling scaling matrix not supported\n",
> +				__func__);
> +			return -EINVAL;
> +		}
> +	}
> +
> +	READ_UEV(log2_max_frame_num_minus4);
> +
> +	READ_UEV(pic_order_cnt_type);
> +	if (sps->pic_order_cnt_type == 0) {
> +		READ_UEV(log2_max_pic_order_cnt_lsb_minus4);
> +	} else if (sps->pic_order_cnt_type == 1) {
> +		READ_BIT(delta_pic_order_always_zero_flag);
> +		READ_SEV(offset_for_non_ref_pic);
> +		READ_SEV(offset_for_top_to_bottom_field);
> +
> +		READ_UEV(num_ref_frames_in_pic_order_cnt_cycle);
> +		for (i = 0; i < sps->num_ref_frames_in_pic_order_cnt_cycle; i++)
> +			READ_SEV(offset_for_ref_frame[i]);
> +	} else {
> +		dev_err(dev,
> +			"%s: Invalid pic_order_cnt_type %u\n", __func__,
> +			sps->pic_order_cnt_type);
> +		return -EINVAL;
> +	}
> +
> +	READ_UEV(max_num_ref_frames);
> +	READ_BIT(gaps_in_frame_num_value_allowed_flag);
> +	READ_UEV(pic_width_in_mbs_minus1);
> +	READ_UEV(pic_height_in_map_units_minus1);
> +
> +	READ_BIT(frame_mbs_only_flag);
> +	if (!sps->frame_mbs_only_flag)
> +		READ_BIT(mb_adaptive_frame_field_flag);
> +
> +	READ_BIT(direct_8x8_inference_flag);
> +
> +	READ_BIT(frame_cropping_flag);
> +	if (sps->frame_cropping_flag) {
> +		READ_UEV(crop_left);
> +		READ_UEV(crop_right);
> +		READ_UEV(crop_top);
> +		READ_UEV(crop_bottom);
> +	}
> +
> +	READ_BIT(vui_parameters_present_flag);
> +	if (sps->vui_parameters_present_flag) {
> +		err = nal_h264_read_vui_parameters(dev, rbsp, &sps->vui);
> +		if (err)
> +			return err;
> +	}
> +
> +	return 0;
> +}
> +
> +static int nal_h264_rbsp_write_pps(const struct device *dev,
> +				   struct rbsp *rbsp, struct nal_h264_pps *pps)
> +{
> +	struct nal_h264_pps *s = pps;
> +	int i;
> +
> +	WRITE_UEV(pic_parameter_set_id);
> +	WRITE_UEV(seq_parameter_set_id);
> +	WRITE_BIT(entropy_coding_mode_flag);
> +	WRITE_BIT(bottom_field_pic_order_in_frame_present_flag);
> +	WRITE_UEV(num_slice_groups_minus1);
> +	if (pps->num_slice_groups_minus1 > 0) {
> +		WRITE_UEV(slice_group_map_type);
> +		if (pps->slice_group_map_type == 0) {
> +			for (i = 0; i < pps->num_slice_groups_minus1; i++)
> +				WRITE_UEV(run_length_minus1[i]);
> +		} else if (pps->slice_group_map_type == 2) {
> +			for (i = 0; i < pps->num_slice_groups_minus1; i++) {
> +				WRITE_UEV(top_left[i]);
> +				WRITE_UEV(bottom_right[i]);
> +			}
> +		} else if (pps->slice_group_map_type == 3 ||
> +			   pps->slice_group_map_type == 4 ||
> +			   pps->slice_group_map_type == 5) {
> +			WRITE_BIT(slice_group_change_direction_flag);
> +			WRITE_UEV(slice_group_change_rate_minus1);
> +		} else if (pps->slice_group_map_type == 6) {
> +			WRITE_UEV(pic_size_in_map_units_minus1);
> +			for (i = 0; i < pps->pic_size_in_map_units_minus1; i++)
> +				WRITE_BITS(order_base_2
> +					   (s->num_slice_groups_minus1 + 1),
> +					   slice_group_id[i]);
> +		}
> +	}
> +	WRITE_UEV(num_ref_idx_l0_default_active_minus1);
> +	WRITE_UEV(num_ref_idx_l1_default_active_minus1);
> +	WRITE_BIT(weighted_pred_flag);
> +	WRITE_BITS(2, weighted_bipred_idc);
> +	WRITE_SEV(pic_init_qp_minus26);
> +	WRITE_SEV(pic_init_qs_minus26);
> +	WRITE_SEV(chroma_qp_index_offset);
> +	WRITE_BIT(deblocking_filter_control_present_flag);
> +	WRITE_BIT(constrained_intra_pred_flag);
> +	WRITE_BIT(redundant_pic_cnt_present_flag);
> +	if (/* more_rbsp_data() */ false) {
> +		WRITE_BIT(transform_8x8_mode_flag);
> +		WRITE_BIT(pic_scaling_matrix_present_flag);
> +		if (pps->pic_scaling_matrix_present_flag) {
> +			dev_err(dev,
> +				"%s: Handling scaling matrix not supported\n",
> +				__func__);
> +			return -EINVAL;
> +		}
> +		WRITE_SEV(second_chroma_qp_index_offset);
> +	}
> +
> +	return 0;
> +}
> +
> +static int nal_h264_rbsp_read_pps(const struct device *dev,
> +				  struct rbsp *rbsp, struct nal_h264_pps *pps)
> +{
> +	struct nal_h264_pps *s = pps;
> +	unsigned int i;
> +
> +	READ_UEV(pic_parameter_set_id);
> +	READ_UEV(seq_parameter_set_id);
> +	READ_BIT(entropy_coding_mode_flag);
> +	READ_BIT(bottom_field_pic_order_in_frame_present_flag);
> +	READ_UEV(num_slice_groups_minus1);
> +	if (s->num_slice_groups_minus1 > 0) {
> +		READ_UEV(slice_group_map_type);
> +		if (pps->slice_group_map_type == 0) {
> +			for (i = 0; i < pps->num_slice_groups_minus1; i++)
> +				READ_UEV(run_length_minus1[i]);
> +		} else if (pps->slice_group_map_type == 2) {
> +			for (i = 0; i < pps->num_slice_groups_minus1; i++) {
> +				READ_UEV(top_left[i]);
> +				READ_UEV(bottom_right[i]);
> +			}
> +		} else if (s->slice_group_map_type == 3 ||
> +			   s->slice_group_map_type == 4 ||
> +			   s->slice_group_map_type == 5) {
> +			READ_BIT(slice_group_change_direction_flag);
> +			READ_UEV(slice_group_change_rate_minus1);
> +		} else if (s->slice_group_map_type == 6) {
> +			READ_UEV(pic_size_in_map_units_minus1);
> +			for (i = 0; i < s->pic_size_in_map_units_minus1; i++)
> +				READ_BITS(order_base_2
> +					  (s->num_slice_groups_minus1 + 1),
> +					  slice_group_id[i]);
> +		}
> +	}
> +	READ_UEV(num_ref_idx_l0_default_active_minus1);
> +	READ_UEV(num_ref_idx_l1_default_active_minus1);
> +	READ_BIT(weighted_pred_flag);
> +	READ_BITS(2, weighted_bipred_idc);
> +	READ_SEV(pic_init_qp_minus26);
> +	READ_SEV(pic_init_qs_minus26);
> +	READ_SEV(chroma_qp_index_offset);
> +	READ_BIT(deblocking_filter_control_present_flag);
> +	READ_BIT(constrained_intra_pred_flag);
> +	READ_BIT(redundant_pic_cnt_present_flag);
> +	if (/* more_rbsp_data() */ false) {
> +		READ_BIT(transform_8x8_mode_flag);
> +		READ_BIT(pic_scaling_matrix_present_flag);
> +		if (pps->pic_scaling_matrix_present_flag) {
> +			dev_err(dev,
> +				"%s: Handling scaling matrix not supported\n",
> +				__func__);
> +			return -EINVAL;
> +		}
> +		READ_SEV(second_chroma_qp_index_offset);
> +	}
> +
> +	return 0;
> +}
> +
> +ssize_t nal_h264_write_sps(const struct device *dev,
> +			   void *dest, size_t n, struct nal_h264_sps *sps)
> +{
> +	struct rbsp rbsp;
> +	int err;
> +	u8 *p = dest;
> +
> +	rbsp.buf = p + 5;
> +	rbsp.size = n - 5;
> +	rbsp.pos = 0;
> +
> +	err = nal_h264_rbsp_write_sps(dev, &rbsp, sps);
> +	if (err)
> +		return err;
> +
> +	err = nal_h264_write_trailing_bits(dev, &rbsp);
> +	if (err)
> +		return err;
> +
> +	p[0] = 0x00;
> +	p[1] = 0x00;
> +	p[2] = 0x00;
> +	p[3] = 0x01;
> +	p[4] = 0x07;
> +
> +	return ((rbsp.pos + 7) / 8) + 5;
> +}
> +EXPORT_SYMBOL_GPL(nal_h264_write_sps);
> +
> +ssize_t nal_h264_read_sps(const struct device *dev,
> +			  struct nal_h264_sps *sps, void *src, size_t n)
> +{
> +	struct rbsp rbsp;
> +	int err;
> +
> +	rbsp.buf = src;
> +	rbsp.size = n;
> +	rbsp.pos = 0;
> +
> +	rbsp.buf += 5;
> +	rbsp.size -= 5;
> +
> +	err = nal_h264_rbsp_read_sps(dev, &rbsp, sps);
> +	if (err)
> +		return err;
> +
> +	return ((rbsp.pos + 7) / 8) + 5;
> +}
> +EXPORT_SYMBOL_GPL(nal_h264_read_sps);
> +
> +void nal_h264_print_sps(const struct device *dev, struct nal_h264_sps *sps)
> +{
> +	struct nal_h264_sps *s = sps;
> +	unsigned int i;
> +
> +	if (!sps)
> +		return;
> +
> +	PRINT_BITS(8, profile_idc);
> +	PRINT_BIT(constraint_set0_flag);
> +	PRINT_BIT(constraint_set1_flag);
> +	PRINT_BIT(constraint_set2_flag);
> +	PRINT_BIT(constraint_set3_flag);
> +	PRINT_BIT(constraint_set4_flag);
> +	PRINT_BIT(constraint_set5_flag);
> +	PRINT_BITS(2, reserved_zero_2bits);
> +	PRINT_BITS(8, level_idc);
> +
> +	PRINT_UEV(seq_parameter_set_id);
> +
> +	if (sps->profile_idc == 100 || sps->profile_idc == 110 ||
> +	    sps->profile_idc == 122 || sps->profile_idc == 244 ||
> +	    sps->profile_idc == 44 || sps->profile_idc == 83 ||
> +	    sps->profile_idc == 86 || sps->profile_idc == 118 ||
> +	    sps->profile_idc == 128 || sps->profile_idc == 138 ||
> +	    sps->profile_idc == 139 || sps->profile_idc == 134 ||
> +	    sps->profile_idc == 135) {
> +		PRINT_UEV(chroma_format_idc);
> +
> +		if (sps->chroma_format_idc == 3)
> +			PRINT_BIT(separate_colour_plane_flag);
> +
> +		PRINT_UEV(bit_depth_luma_minus8);
> +		PRINT_UEV(bit_depth_chroma_minus8);
> +		PRINT_BIT(qpprime_y_zero_transform_bypass_flag);
> +		PRINT_BIT(seq_scaling_matrix_present_flag);
> +
> +		if (sps->seq_scaling_matrix_present_flag)
> +			dev_err(dev,
> +				"%s: Handling scaling matrix not supported\n",
> +				__func__);
> +	}
> +
> +	PRINT_UEV(log2_max_frame_num_minus4);
> +
> +	PRINT_UEV(pic_order_cnt_type);
> +	if (sps->pic_order_cnt_type == 0) {
> +		PRINT_UEV(log2_max_pic_order_cnt_lsb_minus4);
> +	} else if (sps->pic_order_cnt_type == 1) {
> +		PRINT_BIT(delta_pic_order_always_zero_flag);
> +		PRINT_SEV(offset_for_non_ref_pic);
> +		PRINT_SEV(offset_for_top_to_bottom_field);
> +
> +		PRINT_UEV(num_ref_frames_in_pic_order_cnt_cycle);
> +		for (i = 0; i < sps->num_ref_frames_in_pic_order_cnt_cycle; i++)
> +			PRINT_SEV(offset_for_ref_frame[i]);
> +	} else {
> +		dev_err(dev,
> +			"%s: Invalid pic_order_cnt_type %u\n", __func__,
> +			sps->pic_order_cnt_type);
> +	}
> +
> +	PRINT_UEV(max_num_ref_frames);
> +	PRINT_BIT(gaps_in_frame_num_value_allowed_flag);
> +	PRINT_UEV(pic_width_in_mbs_minus1);
> +	PRINT_UEV(pic_height_in_map_units_minus1);
> +
> +	PRINT_BIT(frame_mbs_only_flag);
> +	if (!sps->frame_mbs_only_flag)
> +		PRINT_BIT(mb_adaptive_frame_field_flag);
> +
> +	PRINT_BIT(direct_8x8_inference_flag);
> +
> +	PRINT_BIT(frame_cropping_flag);
> +	if (sps->frame_cropping_flag) {
> +		PRINT_UEV(crop_left);
> +		PRINT_UEV(crop_right);
> +		PRINT_UEV(crop_top);
> +		PRINT_UEV(crop_bottom);
> +	}
> +
> +	PRINT_BIT(vui_parameters_present_flag);
> +	if (sps->vui_parameters_present_flag)
> +		nal_h264_print_vui_parameters(dev, &sps->vui);
> +}
> +EXPORT_SYMBOL_GPL(nal_h264_print_sps);
> +
> +ssize_t nal_h264_write_pps(const struct device *dev,
> +			   void *dest, size_t n, struct nal_h264_pps *pps)
> +{
> +	struct rbsp rbsp;
> +	int err;
> +	u8 *p = dest;
> +
> +	rbsp.buf = p + 5;
> +	rbsp.size = n - 5;
> +	rbsp.pos = 0;
> +
> +	err = nal_h264_rbsp_write_pps(dev, &rbsp, pps);
> +	if (err)
> +		return err;
> +
> +	err = nal_h264_write_trailing_bits(dev, &rbsp);
> +	if (err)
> +		return err;
> +
> +	p[0] = 0x00;
> +	p[1] = 0x00;
> +	p[2] = 0x00;
> +	p[3] = 0x01;
> +	p[4] = 0x08;
> +
> +	return ((rbsp.pos + 7) / 8) + 5;
> +}
> +EXPORT_SYMBOL_GPL(nal_h264_write_pps);
> +
> +ssize_t nal_h264_read_pps(const struct device *dev,
> +			  struct nal_h264_pps *pps, void *src, size_t n)
> +{
> +	struct rbsp rbsp;
> +	int err;
> +
> +	rbsp.buf = src;
> +	rbsp.size = n;
> +	rbsp.pos = 0;
> +
> +	rbsp.buf += 5;
> +	rbsp.size -= 5;
> +
> +	err = nal_h264_rbsp_read_pps(dev, &rbsp, pps);
> +	if (err)
> +		return err;
> +
> +	return ((rbsp.pos + 7) / 8) + 5;
> +}
> +EXPORT_SYMBOL_GPL(nal_h264_read_pps);
> +
> +void nal_h264_print_pps(const struct device *dev, struct nal_h264_pps *pps)
> +{
> +	struct nal_h264_pps *s = pps;
> +	unsigned int i;
> +
> +	if (!pps)
> +		return;
> +
> +	PRINT_UEV(pic_parameter_set_id);
> +	PRINT_UEV(seq_parameter_set_id);
> +	PRINT_BIT(entropy_coding_mode_flag);
> +	PRINT_BIT(bottom_field_pic_order_in_frame_present_flag);
> +	PRINT_UEV(num_slice_groups_minus1);
> +	if (s->num_slice_groups_minus1 > 0) {
> +		PRINT_UEV(slice_group_map_type);
> +		if (pps->slice_group_map_type == 0) {
> +			for (i = 0; i < pps->num_slice_groups_minus1; i++)
> +				PRINT_UEV(run_length_minus1[i]);
> +		} else if (pps->slice_group_map_type == 2) {
> +			for (i = 0; i < pps->num_slice_groups_minus1; i++) {
> +				PRINT_UEV(top_left[i]);
> +				PRINT_UEV(bottom_right[i]);
> +			}
> +		} else if (s->slice_group_map_type == 3 ||
> +			   s->slice_group_map_type == 4 ||
> +			   s->slice_group_map_type == 5) {
> +			PRINT_BIT(slice_group_change_direction_flag);
> +			PRINT_UEV(slice_group_change_rate_minus1);
> +		} else if (s->slice_group_map_type == 6) {
> +			PRINT_UEV(pic_size_in_map_units_minus1);
> +			for (i = 0; i < s->pic_size_in_map_units_minus1; i++)
> +				PRINT_BITS(order_base_2
> +					   (s->num_slice_groups_minus1 + 1),
> +					   slice_group_id[i]);
> +		}
> +	}
> +	PRINT_UEV(num_ref_idx_l0_default_active_minus1);
> +	PRINT_UEV(num_ref_idx_l1_default_active_minus1);
> +	PRINT_BIT(weighted_pred_flag);
> +	PRINT_BITS(2, weighted_bipred_idc);
> +	PRINT_SEV(pic_init_qp_minus26);
> +	PRINT_SEV(pic_init_qs_minus26);
> +	PRINT_SEV(chroma_qp_index_offset);
> +	PRINT_BIT(deblocking_filter_control_present_flag);
> +	PRINT_BIT(constrained_intra_pred_flag);
> +	PRINT_BIT(redundant_pic_cnt_present_flag);
> +	if (/* more_rbsp_data() */ false) {
> +		PRINT_BIT(transform_8x8_mode_flag);
> +		PRINT_BIT(pic_scaling_matrix_present_flag);
> +		if (pps->pic_scaling_matrix_present_flag) {
> +			dev_err(dev,
> +				"%s: Handling scaling matrix not supported\n",
> +				__func__);
> +		}
> +		PRINT_SEV(second_chroma_qp_index_offset);
> +	}
> +}
> +EXPORT_SYMBOL_GPL(nal_h264_print_pps);
> +
> +ssize_t nal_h264_read_filler(const struct device *dev, void *src, size_t n)
> +{
> +	char *p = src;
> +	size_t i = 5;
> +
> +	if (p[0] != 0x00 || p[1] != 0x00 || p[2] != 0x00 || p[3] != 0x01)
> +		return -EINVAL;
> +
> +	if (p[4] != 0x0c)
> +		return -EINVAL;
> +
> +	while (p[i] == 0xff && i < n)
> +		i++;
> +
> +	if (p[i] != 0x80)
> +		return -EINVAL;
> +
> +	return i;
> +}
> +EXPORT_SYMBOL_GPL(nal_h264_read_filler);
> +
> +ssize_t nal_h264_write_filler(const struct device *dev, void *dest, size_t n)
> +{
> +	char *p = dest;
> +
> +	if (n < 6)
> +		return -EINVAL;
> +
> +	p[0] = 0x00;
> +	p[1] = 0x00;
> +	p[2] = 0x00;
> +	p[3] = 0x01;
> +	p[4] = 0x0c;
> +	memset(p + 5, 0xff, n - 6);
> +	p[n - 1] = 0x80;
> +
> +	return n;
> +}
> +EXPORT_SYMBOL_GPL(nal_h264_write_filler);
> diff --git a/drivers/staging/media/allegro-dvt/nal-h264.h b/drivers/staging/media/allegro-dvt/nal-h264.h
> new file mode 100644
> index 000000000000..bd46e8e42caf
> --- /dev/null
> +++ b/drivers/staging/media/allegro-dvt/nal-h264.h
> @@ -0,0 +1,188 @@
> +/* SPDX-License-Identifier: GPL-2.0 */
> +/*
> + * Copyright (C) 2019 Pengutronix, Michael Tretter <kernel@pengutronix.de>
> + *
> + * Convert NAL units between raw byte sequence payloads (RBSP) and C structs.
> + */
> +
> +#ifndef __NAL_H264_H__
> +#define __NAL_H264_H__
> +
> +#include <linux/kernel.h>
> +#include <linux/types.h>
> +
> +/* Rec. ITU-T H.264 (04/2017) E.1.2 */
> +struct nal_h264_hrd_parameters {
> +	unsigned int cpb_cnt_minus1;
> +	unsigned int bit_rate_scale:4;
> +	unsigned int cpb_size_scale:4;
> +	struct {
> +		int bit_rate_value_minus1[16];
> +		int cpb_size_value_minus1[16];
> +		unsigned int cbr_flag[16]; /* FIXME cbr_flag:1 */
> +	};
> +	unsigned int initial_cpb_removal_delay_length_minus1:5;
> +	unsigned int cpb_removal_delay_length_minus1:5;
> +	unsigned int dpb_output_delay_length_minus1:5;
> +	unsigned int time_offset_length:5;
> +};
> +
> +/* Rec. ITU-T H.264 (04/2017) E.1.1 */
> +struct nal_h264_vui_parameters {
> +	unsigned int aspect_ratio_info_present_flag:1;
> +	struct {
> +		unsigned int aspect_ratio_idc:8;
> +		unsigned int sar_width:16;
> +		unsigned int sar_height:16;
> +	};
> +	unsigned int overscan_info_present_flag:1;
> +	unsigned int overscan_appropriate_flag:1;
> +	unsigned int video_signal_type_present_flag:1;
> +	struct {
> +		unsigned int video_format:3;
> +		unsigned int video_full_range_flag:1;
> +		unsigned int colour_description_present_flag:1;
> +		struct {
> +			unsigned int colour_primaries:8;
> +			unsigned int transfer_characteristics:8;
> +			unsigned int matrix_coefficients:8;
> +		};
> +	};
> +	unsigned int chroma_loc_info_present_flag:1;
> +	struct {
> +		unsigned int chroma_sample_loc_type_top_field;
> +		unsigned int chroma_sample_loc_type_bottom_field;
> +	};
> +	unsigned int timing_info_present_flag:1;
> +	struct {
> +	unsigned int num_units_in_tick:32;
> +	unsigned int time_scale:32;
> +	unsigned int fixed_frame_rate_flag:1;
> +	};
> +	unsigned int nal_hrd_parameters_present_flag:1;
> +	struct nal_h264_hrd_parameters nal_hrd_parameters;
> +	unsigned int vcl_hrd_parameters_present_flag:1;
> +	struct nal_h264_hrd_parameters vcl_hrd_parameters;
> +	unsigned int low_delay_hrd_flag:1;
> +	unsigned int pic_struct_present_flag:1;
> +	unsigned int bitstream_restriction_flag:1;
> +	struct {
> +		unsigned int motion_vectors_over_pic_boundaries_flag:1;
> +		unsigned int max_bytes_per_pic_denom;
> +		unsigned int max_bits_per_mb_denom;
> +		unsigned int log2_max_mv_length_horizontal;
> +		unsigned int log21_max_mv_length_vertical;
> +		unsigned int max_num_reorder_frames;
> +		unsigned int max_dec_frame_buffering;
> +	};
> +};
> +
> +/* Rec. ITU-T H.264 (04/2017) 7.3.2.1.1 Sequence parameter set data syntax */
> +struct nal_h264_sps {
> +	unsigned int profile_idc:8;
> +	unsigned int constraint_set0_flag:1;
> +	unsigned int constraint_set1_flag:1;
> +	unsigned int constraint_set2_flag:1;
> +	unsigned int constraint_set3_flag:1;
> +	unsigned int constraint_set4_flag:1;
> +	unsigned int constraint_set5_flag:1;
> +	unsigned int reserved_zero_2bits:2;
> +	unsigned int level_idc:8;
> +	unsigned int seq_parameter_set_id;
> +	struct {
> +		unsigned int chroma_format_idc;
> +		unsigned int separate_colour_plane_flag:1;
> +		unsigned int bit_depth_luma_minus8;
> +		unsigned int bit_depth_chroma_minus8;
> +		unsigned int qpprime_y_zero_transform_bypass_flag:1;
> +		unsigned int seq_scaling_matrix_present_flag:1;
> +	};
> +	unsigned int log2_max_frame_num_minus4;
> +	unsigned int pic_order_cnt_type;
> +	union {
> +		unsigned int log2_max_pic_order_cnt_lsb_minus4;
> +		struct {
> +			unsigned int delta_pic_order_always_zero_flag:1;
> +			int offset_for_non_ref_pic;
> +			int offset_for_top_to_bottom_field;
> +			unsigned int num_ref_frames_in_pic_order_cnt_cycle;
> +			int offset_for_ref_frame[255];
> +		};
> +	};
> +	unsigned int max_num_ref_frames;
> +	unsigned int gaps_in_frame_num_value_allowed_flag:1;
> +	unsigned int pic_width_in_mbs_minus1;
> +	unsigned int pic_height_in_map_units_minus1;
> +	unsigned int frame_mbs_only_flag:1;
> +	unsigned int mb_adaptive_frame_field_flag:1;
> +	unsigned int direct_8x8_inference_flag:1;
> +	unsigned int frame_cropping_flag:1;
> +	struct {
> +		unsigned int crop_left;
> +		unsigned int crop_right;
> +		unsigned int crop_top;
> +		unsigned int crop_bottom;
> +	};
> +	unsigned int vui_parameters_present_flag:1;
> +	struct nal_h264_vui_parameters vui;
> +};
> +
> +/* Rec. ITU-T H.264 (04/2017) 7.3.2.2 Picture parameter set RBSP syntax */
> +struct nal_h264_pps {
> +	unsigned int pic_parameter_set_id;
> +	unsigned int seq_parameter_set_id;
> +	unsigned int entropy_coding_mode_flag:1;
> +	unsigned int bottom_field_pic_order_in_frame_present_flag:1;
> +	unsigned int num_slice_groups_minus1;
> +	unsigned int slice_group_map_type;
> +	union {
> +		unsigned int run_length_minus1[8];
> +		struct {
> +			unsigned int top_left[8];
> +			unsigned int bottom_right[8];
> +		};
> +		struct {
> +			unsigned int slice_group_change_direction_flag:1;
> +			unsigned int slice_group_change_rate_minus1;
> +		};
> +		struct {
> +			unsigned int pic_size_in_map_units_minus1;
> +			unsigned int slice_group_id[8];
> +		};
> +	};
> +	unsigned int num_ref_idx_l0_default_active_minus1;
> +	unsigned int num_ref_idx_l1_default_active_minus1;
> +	unsigned int weighted_pred_flag:1;
> +	unsigned int weighted_bipred_idc:2;
> +	int pic_init_qp_minus26;
> +	int pic_init_qs_minus26;
> +	int chroma_qp_index_offset;
> +	unsigned int deblocking_filter_control_present_flag:1;
> +	unsigned int constrained_intra_pred_flag:1;
> +	unsigned int redundant_pic_cnt_present_flag:1;
> +	struct {
> +		unsigned int transform_8x8_mode_flag:1;
> +		unsigned int pic_scaling_matrix_present_flag:1;
> +		int second_chroma_qp_index_offset;
> +	};
> +};
> +
> +int nal_h264_level_from_v4l2(enum v4l2_mpeg_video_h264_level level);
> +int nal_h264_profile_from_v4l2(enum v4l2_mpeg_video_h264_profile profile);
> +
> +ssize_t nal_h264_write_sps(const struct device *dev,
> +			   void *dest, size_t n, struct nal_h264_sps *sps);
> +ssize_t nal_h264_read_sps(const struct device *dev,
> +			  struct nal_h264_sps *sps, void *src, size_t n);
> +void nal_h264_print_sps(const struct device *dev, struct nal_h264_sps *sps);
> +
> +ssize_t nal_h264_write_pps(const struct device *dev,
> +			   void *dest, size_t n, struct nal_h264_pps *pps);
> +ssize_t nal_h264_read_pps(const struct device *dev,
> +			  struct nal_h264_pps *pps, void *src, size_t n);
> +void nal_h264_print_pps(const struct device *dev, struct nal_h264_pps *pps);
> +
> +ssize_t nal_h264_write_filler(const struct device *dev, void *dest, size_t n);
> +ssize_t nal_h264_read_filler(const struct device *dev, void *src, size_t n);

Can you document these functions? I think there is a good chance that these
functions can be reused in other drivers in the future, so having some
documentation here would be very useful.

Regards,

	Hans

> +
> +#endif /* __NAL_H264_H__ */
>

[v3,0/3] Add ZynqMP VCU/Allegro DVT H.264 encoder driver

Message

Comments