1.. SPDX-License-Identifier: GPL-2.0 2 3.. include:: <isonum.txt> 4 5=============================================================== 6Intel Image Processing Unit 3 (IPU3) Imaging Unit (ImgU) driver 7=============================================================== 8 9Copyright |copy| 2018 Intel Corporation 10 11Introduction 12============ 13 14This file documents the Intel IPU3 (3rd generation Image Processing Unit) 15Imaging Unit drivers located under drivers/media/pci/intel/ipu3 (CIO2) as well 16as under drivers/staging/media/ipu3 (ImgU). 17 18The Intel IPU3 found in certain Kaby Lake (as well as certain Sky Lake) 19platforms (U/Y processor lines) is made up of two parts namely the Imaging Unit 20(ImgU) and the CIO2 device (MIPI CSI2 receiver). 21 22The CIO2 device receives the raw Bayer data from the sensors and outputs the 23frames in a format that is specific to the IPU3 (for consumption by the IPU3 24ImgU). The CIO2 driver is available as drivers/media/pci/intel/ipu3/ipu3-cio2* 25and is enabled through the CONFIG_VIDEO_IPU3_CIO2 config option. 26 27The Imaging Unit (ImgU) is responsible for processing images captured 28by the IPU3 CIO2 device. The ImgU driver sources can be found under 29drivers/staging/media/ipu3 directory. The driver is enabled through the 30CONFIG_VIDEO_IPU3_IMGU config option. 31 32The two driver modules are named ipu3_csi2 and ipu3_imgu, respectively. 33 34The drivers has been tested on Kaby Lake platforms (U/Y processor lines). 35 36Both of the drivers implement V4L2, Media Controller and V4L2 sub-device 37interfaces. The IPU3 CIO2 driver supports camera sensors connected to the CIO2 38MIPI CSI-2 interfaces through V4L2 sub-device sensor drivers. 39 40CIO2 41==== 42 43The CIO2 is represented as a single V4L2 subdev, which provides a V4L2 subdev 44interface to the user space. There is a video node for each CSI-2 receiver, 45with a single media controller interface for the entire device. 46 47The CIO2 contains four independent capture channel, each with its own MIPI CSI-2 48receiver and DMA engine. Each channel is modelled as a V4L2 sub-device exposed 49to userspace as a V4L2 sub-device node and has two pads: 50 51.. tabularcolumns:: |p{0.8cm}|p{4.0cm}|p{4.0cm}| 52 53.. flat-table:: 54 55 * - pad 56 - direction 57 - purpose 58 59 * - 0 60 - sink 61 - MIPI CSI-2 input, connected to the sensor subdev 62 63 * - 1 64 - source 65 - Raw video capture, connected to the V4L2 video interface 66 67The V4L2 video interfaces model the DMA engines. They are exposed to userspace 68as V4L2 video device nodes. 69 70Capturing frames in raw Bayer format 71------------------------------------ 72 73CIO2 MIPI CSI2 receiver is used to capture frames (in packed raw Bayer format) 74from the raw sensors connected to the CSI2 ports. The captured frames are used 75as input to the ImgU driver. 76 77Image processing using IPU3 ImgU requires tools such as raw2pnm [#f1]_, and 78yavta [#f2]_ due to the following unique requirements and / or features specific 79to IPU3. 80 81-- The IPU3 CSI2 receiver outputs the captured frames from the sensor in packed 82raw Bayer format that is specific to IPU3. 83 84-- Multiple video nodes have to be operated simultaneously. 85 86Let us take the example of ov5670 sensor connected to CSI2 port 0, for a 872592x1944 image capture. 88 89Using the media controller APIs, the ov5670 sensor is configured to send 90frames in packed raw Bayer format to IPU3 CSI2 receiver. 91 92.. code-block:: none 93 94 # This example assumes /dev/media0 as the CIO2 media device 95 export MDEV=/dev/media0 96 97 # and that ov5670 sensor is connected to i2c bus 10 with address 0x36 98 export SDEV=$(media-ctl -d $MDEV -e "ov5670 10-0036") 99 100 # Establish the link for the media devices using media-ctl [#f3]_ 101 media-ctl -d $MDEV -l "ov5670:0 -> ipu3-csi2 0:0[1]" 102 103 # Set the format for the media devices 104 media-ctl -d $MDEV -V "ov5670:0 [fmt:SGRBG10/2592x1944]" 105 media-ctl -d $MDEV -V "ipu3-csi2 0:0 [fmt:SGRBG10/2592x1944]" 106 media-ctl -d $MDEV -V "ipu3-csi2 0:1 [fmt:SGRBG10/2592x1944]" 107 108Once the media pipeline is configured, desired sensor specific settings 109(such as exposure and gain settings) can be set, using the yavta tool. 110 111e.g 112 113.. code-block:: none 114 115 yavta -w 0x009e0903 444 $SDEV 116 yavta -w 0x009e0913 1024 $SDEV 117 yavta -w 0x009e0911 2046 $SDEV 118 119Once the desired sensor settings are set, frame captures can be done as below. 120 121e.g 122 123.. code-block:: none 124 125 yavta --data-prefix -u -c10 -n5 -I -s2592x1944 --file=/tmp/frame-#.bin \ 126 -f IPU3_SGRBG10 $(media-ctl -d $MDEV -e "ipu3-cio2 0") 127 128With the above command, 10 frames are captured at 2592x1944 resolution, with 129sGRBG10 format and output as IPU3_SGRBG10 format. 130 131The captured frames are available as /tmp/frame-#.bin files. 132 133ImgU 134==== 135 136The ImgU is represented as two V4L2 subdevs, each of which provides a V4L2 137subdev interface to the user space. 138 139Each V4L2 subdev represents a pipe, which can support a maximum of 2 streams. 140This helps to support advanced camera features like Continuous View Finder (CVF) 141and Snapshot During Video(SDV). 142 143The ImgU contains two independent pipes, each modelled as a V4L2 sub-device 144exposed to userspace as a V4L2 sub-device node. 145 146Each pipe has two sink pads and three source pads for the following purpose: 147 148.. tabularcolumns:: |p{0.8cm}|p{4.0cm}|p{4.0cm}| 149 150.. flat-table:: 151 152 * - pad 153 - direction 154 - purpose 155 156 * - 0 157 - sink 158 - Input raw video stream 159 160 * - 1 161 - sink 162 - Processing parameters 163 164 * - 2 165 - source 166 - Output processed video stream 167 168 * - 3 169 - source 170 - Output viewfinder video stream 171 172 * - 4 173 - source 174 - 3A statistics 175 176Each pad is connected to a corresponding V4L2 video interface, exposed to 177userspace as a V4L2 video device node. 178 179Device operation 180---------------- 181 182With ImgU, once the input video node ("ipu3-imgu 0/1":0, in 183<entity>:<pad-number> format) is queued with buffer (in packed raw Bayer 184format), ImgU starts processing the buffer and produces the video output in YUV 185format and statistics output on respective output nodes. The driver is expected 186to have buffers ready for all of parameter, output and statistics nodes, when 187input video node is queued with buffer. 188 189At a minimum, all of input, main output, 3A statistics and viewfinder 190video nodes should be enabled for IPU3 to start image processing. 191 192Each ImgU V4L2 subdev has the following set of video nodes. 193 194input, output and viewfinder video nodes 195---------------------------------------- 196 197The frames (in packed raw Bayer format specific to the IPU3) received by the 198input video node is processed by the IPU3 Imaging Unit and are output to 2 video 199nodes, with each targeting a different purpose (main output and viewfinder 200output). 201 202Details onand the Bayer format specific to the IPU3 can be found in 203:ref:`v4l2-pix-fmt-ipu3-sbggr10`. 204 205The driver supports V4L2 Video Capture Interface as defined at :ref:`devices`. 206 207Only the multi-planar API is supported. More details can be found at 208:ref:`planar-apis`. 209 210Parameters video node 211--------------------- 212 213The parameters video node receives the ImgU algorithm parameters that are used 214to configure how the ImgU algorithms process the image. 215 216Details on processing parameters specific to the IPU3 can be found in 217:ref:`v4l2-meta-fmt-params`. 218 2193A statistics video node 220------------------------ 221 2223A statistics video node is used by the ImgU driver to output the 3A (auto 223focus, auto exposure and auto white balance) statistics for the frames that are 224being processed by the ImgU to user space applications. User space applications 225can use this statistics data to compute the desired algorithm parameters for 226the ImgU. 227 228Configuring the Intel IPU3 229========================== 230 231The IPU3 ImgU pipelines can be configured using the Media Controller, defined at 232:ref:`media_controller`. 233 234Running mode and firmware binary selection 235------------------------------------------ 236 237ImgU works based on firmware, currently the ImgU firmware support run 2 pipes 238in time-sharing with single input frame data. Each pipe can run at certain mode 239- "VIDEO" or "STILL", "VIDEO" mode is commonly used for video frames capture, 240and "STILL" is used for still frame capture. However, you can also select 241"VIDEO" to capture still frames if you want to capture images with less system 242load and power. For "STILL" mode, ImgU will try to use smaller BDS factor and 243output larger bayer frame for further YUV processing than "VIDEO" mode to get 244high quality images. Besides, "STILL" mode need XNR3 to do noise reduction, 245hence "STILL" mode will need more power and memory bandwidth than "VIDEO" mode. 246TNR will be enabled in "VIDEO" mode and bypassed by "STILL" mode. ImgU is 247running at "VIDEO" mode by default, the user can use v4l2 control 248V4L2_CID_INTEL_IPU3_MODE (currently defined in 249drivers/staging/media/ipu3/include/uapi/intel-ipu3.h) to query and set the 250running mode. For user, there is no difference for buffer queueing between the 251"VIDEO" and "STILL" mode, mandatory input and main output node should be 252enabled and buffers need be queued, the statistics and the view-finder queues 253are optional. 254 255The firmware binary will be selected according to current running mode, such log 256"using binary if_to_osys_striped " or "using binary if_to_osys_primary_striped" 257could be observed if you enable the ImgU dynamic debug, the binary 258if_to_osys_striped is selected for "VIDEO" and the binary 259"if_to_osys_primary_striped" is selected for "STILL". 260 261 262Processing the image in raw Bayer format 263---------------------------------------- 264 265Configuring ImgU V4L2 subdev for image processing 266~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 267 268The ImgU V4L2 subdevs have to be configured with media controller APIs to have 269all the video nodes setup correctly. 270 271Let us take "ipu3-imgu 0" subdev as an example. 272 273.. code-block:: none 274 275 media-ctl -d $MDEV -r 276 media-ctl -d $MDEV -l "ipu3-imgu 0 input":0 -> "ipu3-imgu 0":0[1] 277 media-ctl -d $MDEV -l "ipu3-imgu 0":2 -> "ipu3-imgu 0 output":0[1] 278 media-ctl -d $MDEV -l "ipu3-imgu 0":3 -> "ipu3-imgu 0 viewfinder":0[1] 279 media-ctl -d $MDEV -l "ipu3-imgu 0":4 -> "ipu3-imgu 0 3a stat":0[1] 280 281Also the pipe mode of the corresponding V4L2 subdev should be set as desired 282(e.g 0 for video mode or 1 for still mode) through the control id 0x009819a1 as 283below. 284 285.. code-block:: none 286 287 yavta -w "0x009819A1 1" /dev/v4l-subdev7 288 289Certain hardware blocks in ImgU pipeline can change the frame resolution by 290cropping or scaling, these hardware blocks include Input Feeder(IF), Bayer Down 291Scaler (BDS) and Geometric Distortion Correction (GDC). 292There is also a block which can change the frame resolution - YUV Scaler, it is 293only applicable to the secondary output. 294 295RAW Bayer frames go through these ImgU pipeline hardware blocks and the final 296processed image output to the DDR memory. 297 298.. kernel-figure:: ipu3_rcb.svg 299 :alt: ipu3 resolution blocks image 300 301 IPU3 resolution change hardware blocks 302 303**Input Feeder** 304 305Input Feeder gets the Bayer frame data from the sensor, it can enable cropping 306of lines and columns from the frame and then store pixels into device's internal 307pixel buffer which are ready to readout by following blocks. 308 309**Bayer Down Scaler** 310 311Bayer Down Scaler is capable of performing image scaling in Bayer domain, the 312downscale factor can be configured from 1X to 1/4X in each axis with 313configuration steps of 0.03125 (1/32). 314 315**Geometric Distortion Correction** 316 317Geometric Distortion Correction is used to perform correction of distortions 318and image filtering. It needs some extra filter and envelope padding pixels to 319work, so the input resolution of GDC should be larger than the output 320resolution. 321 322**YUV Scaler** 323 324YUV Scaler which similar with BDS, but it is mainly do image down scaling in 325YUV domain, it can support up to 1/12X down scaling, but it can not be applied 326to the main output. 327 328The ImgU V4L2 subdev has to be configured with the supported resolutions in all 329the above hardware blocks, for a given input resolution. 330For a given supported resolution for an input frame, the Input Feeder, Bayer 331Down Scaler and GDC blocks should be configured with the supported resolutions 332as each hardware block has its own alignment requirement. 333 334You must configure the output resolution of the hardware blocks smartly to meet 335the hardware requirement along with keeping the maximum field of view. The 336intermediate resolutions can be generated by specific tool - 337 338https://github.com/intel/intel-ipu3-pipecfg 339 340This tool can be used to generate intermediate resolutions. More information can 341be obtained by looking at the following IPU3 ImgU configuration table. 342 343https://chromium.googlesource.com/chromiumos/overlays/board-overlays/+/master 344 345Under baseboard-poppy/media-libs/cros-camera-hal-configs-poppy/files/gcss 346directory, graph_settings_ov5670.xml can be used as an example. 347 348The following steps prepare the ImgU pipeline for the image processing. 349 3501. The ImgU V4L2 subdev data format should be set by using the 351VIDIOC_SUBDEV_S_FMT on pad 0, using the GDC width and height obtained above. 352 3532. The ImgU V4L2 subdev cropping should be set by using the 354VIDIOC_SUBDEV_S_SELECTION on pad 0, with V4L2_SEL_TGT_CROP as the target, 355using the input feeder height and width. 356 3573. The ImgU V4L2 subdev composing should be set by using the 358VIDIOC_SUBDEV_S_SELECTION on pad 0, with V4L2_SEL_TGT_COMPOSE as the target, 359using the BDS height and width. 360 361For the ov5670 example, for an input frame with a resolution of 2592x1944 362(which is input to the ImgU subdev pad 0), the corresponding resolutions 363for input feeder, BDS and GDC are 2592x1944, 2592x1944 and 2560x1920 364respectively. 365 366Once this is done, the received raw Bayer frames can be input to the ImgU 367V4L2 subdev as below, using the open source application v4l2n [#f1]_. 368 369For an image captured with 2592x1944 [#f4]_ resolution, with desired output 370resolution as 2560x1920 and viewfinder resolution as 2560x1920, the following 371v4l2n command can be used. This helps process the raw Bayer frames and produces 372the desired results for the main output image and the viewfinder output, in NV12 373format. 374 375.. code-block:: none 376 377 v4l2n --pipe=4 --load=/tmp/frame-#.bin --open=/dev/video4 378 --fmt=type:VIDEO_OUTPUT_MPLANE,width=2592,height=1944,pixelformat=0X47337069 \ 379 --reqbufs=type:VIDEO_OUTPUT_MPLANE,count:1 --pipe=1 \ 380 --output=/tmp/frames.out --open=/dev/video5 \ 381 --fmt=type:VIDEO_CAPTURE_MPLANE,width=2560,height=1920,pixelformat=NV12 \ 382 --reqbufs=type:VIDEO_CAPTURE_MPLANE,count:1 --pipe=2 \ 383 --output=/tmp/frames.vf --open=/dev/video6 \ 384 --fmt=type:VIDEO_CAPTURE_MPLANE,width=2560,height=1920,pixelformat=NV12 \ 385 --reqbufs=type:VIDEO_CAPTURE_MPLANE,count:1 --pipe=3 --open=/dev/video7 \ 386 --output=/tmp/frames.3A --fmt=type:META_CAPTURE,? \ 387 --reqbufs=count:1,type:META_CAPTURE --pipe=1,2,3,4 --stream=5 388 389You can also use yavta [#f2]_ command to do same thing as above: 390 391.. code-block:: none 392 393 yavta --data-prefix -Bcapture-mplane -c10 -n5 -I -s2592x1944 \ 394 --file=frame-#.out-f NV12 /dev/video5 & \ 395 yavta --data-prefix -Bcapture-mplane -c10 -n5 -I -s2592x1944 \ 396 --file=frame-#.vf -f NV12 /dev/video6 & \ 397 yavta --data-prefix -Bmeta-capture -c10 -n5 -I \ 398 --file=frame-#.3a /dev/video7 & \ 399 yavta --data-prefix -Boutput-mplane -c10 -n5 -I -s2592x1944 \ 400 --file=/tmp/frame-in.cio2 -f IPU3_SGRBG10 /dev/video4 401 402where /dev/video4, /dev/video5, /dev/video6 and /dev/video7 devices point to 403input, output, viewfinder and 3A statistics video nodes respectively. 404 405Converting the raw Bayer image into YUV domain 406---------------------------------------------- 407 408The processed images after the above step, can be converted to YUV domain 409as below. 410 411Main output frames 412~~~~~~~~~~~~~~~~~~ 413 414.. code-block:: none 415 416 raw2pnm -x2560 -y1920 -fNV12 /tmp/frames.out /tmp/frames.out.ppm 417 418where 2560x1920 is output resolution, NV12 is the video format, followed 419by input frame and output PNM file. 420 421Viewfinder output frames 422~~~~~~~~~~~~~~~~~~~~~~~~ 423 424.. code-block:: none 425 426 raw2pnm -x2560 -y1920 -fNV12 /tmp/frames.vf /tmp/frames.vf.ppm 427 428where 2560x1920 is output resolution, NV12 is the video format, followed 429by input frame and output PNM file. 430 431Example user space code for IPU3 432================================ 433 434User space code that configures and uses IPU3 is available here. 435 436https://chromium.googlesource.com/chromiumos/platform/arc-camera/+/master/ 437 438The source can be located under hal/intel directory. 439 440Overview of IPU3 pipeline 441========================= 442 443IPU3 pipeline has a number of image processing stages, each of which takes a 444set of parameters as input. The major stages of pipelines are shown here: 445 446.. kernel-render:: DOT 447 :alt: IPU3 ImgU Pipeline 448 :caption: IPU3 ImgU Pipeline Diagram 449 450 digraph "IPU3 ImgU" { 451 node [shape=box] 452 splines="ortho" 453 rankdir="LR" 454 455 a [label="Raw pixels"] 456 b [label="Bayer Downscaling"] 457 c [label="Optical Black Correction"] 458 d [label="Linearization"] 459 e [label="Lens Shading Correction"] 460 f [label="White Balance / Exposure / Focus Apply"] 461 g [label="Bayer Noise Reduction"] 462 h [label="ANR"] 463 i [label="Demosaicing"] 464 j [label="Color Correction Matrix"] 465 k [label="Gamma correction"] 466 l [label="Color Space Conversion"] 467 m [label="Chroma Down Scaling"] 468 n [label="Chromatic Noise Reduction"] 469 o [label="Total Color Correction"] 470 p [label="XNR3"] 471 q [label="TNR"] 472 r [label="DDR", style=filled, fillcolor=yellow, shape=cylinder] 473 s [label="YUV Downscaling"] 474 t [label="DDR", style=filled, fillcolor=yellow, shape=cylinder] 475 476 { rank=same; a -> b -> c -> d -> e -> f -> g -> h -> i } 477 { rank=same; j -> k -> l -> m -> n -> o -> p -> q -> s -> t} 478 479 a -> j [style=invis, weight=10] 480 i -> j 481 q -> r 482 } 483 484The table below presents a description of the above algorithms. 485 486======================== ======================================================= 487Name Description 488======================== ======================================================= 489Optical Black Correction Optical Black Correction block subtracts a pre-defined 490 value from the respective pixel values to obtain better 491 image quality. 492 Defined in struct ipu3_uapi_obgrid_param. 493Linearization This algo block uses linearization parameters to 494 address non-linearity sensor effects. The Lookup table 495 table is defined in 496 struct ipu3_uapi_isp_lin_vmem_params. 497SHD Lens shading correction is used to correct spatial 498 non-uniformity of the pixel response due to optical 499 lens shading. This is done by applying a different gain 500 for each pixel. The gain, black level etc are 501 configured in struct ipu3_uapi_shd_config_static. 502BNR Bayer noise reduction block removes image noise by 503 applying a bilateral filter. 504 See struct ipu3_uapi_bnr_static_config for details. 505ANR Advanced Noise Reduction is a block based algorithm 506 that performs noise reduction in the Bayer domain. The 507 convolution matrix etc can be found in 508 struct ipu3_uapi_anr_config. 509DM Demosaicing converts raw sensor data in Bayer format 510 into RGB (Red, Green, Blue) presentation. Then add 511 outputs of estimation of Y channel for following stream 512 processing by Firmware. The struct is defined as 513 struct ipu3_uapi_dm_config. 514Color Correction Color Correction algo transforms sensor specific color 515 space to the standard "sRGB" color space. This is done 516 by applying 3x3 matrix defined in 517 struct ipu3_uapi_ccm_mat_config. 518Gamma correction Gamma correction struct ipu3_uapi_gamma_config is a 519 basic non-linear tone mapping correction that is 520 applied per pixel for each pixel component. 521CSC Color space conversion transforms each pixel from the 522 RGB primary presentation to YUV (Y: brightness, 523 UV: Luminance) presentation. This is done by applying 524 a 3x3 matrix defined in 525 struct ipu3_uapi_csc_mat_config 526CDS Chroma down sampling 527 After the CSC is performed, the Chroma Down Sampling 528 is applied for a UV plane down sampling by a factor 529 of 2 in each direction for YUV 4:2:0 using a 4x2 530 configurable filter struct ipu3_uapi_cds_params. 531CHNR Chroma noise reduction 532 This block processes only the chrominance pixels and 533 performs noise reduction by cleaning the high 534 frequency noise. 535 See struct struct ipu3_uapi_yuvp1_chnr_config. 536TCC Total color correction as defined in struct 537 struct ipu3_uapi_yuvp2_tcc_static_config. 538XNR3 eXtreme Noise Reduction V3 is the third revision of 539 noise reduction algorithm used to improve image 540 quality. This removes the low frequency noise in the 541 captured image. Two related structs are being defined, 542 struct ipu3_uapi_isp_xnr3_params for ISP data memory 543 and struct ipu3_uapi_isp_xnr3_vmem_params for vector 544 memory. 545TNR Temporal Noise Reduction block compares successive 546 frames in time to remove anomalies / noise in pixel 547 values. struct ipu3_uapi_isp_tnr3_vmem_params and 548 struct ipu3_uapi_isp_tnr3_params are defined for ISP 549 vector and data memory respectively. 550======================== ======================================================= 551 552Other often encountered acronyms not listed in above table: 553 554 ACC 555 Accelerator cluster 556 AWB_FR 557 Auto white balance filter response statistics 558 BDS 559 Bayer downscaler parameters 560 CCM 561 Color correction matrix coefficients 562 IEFd 563 Image enhancement filter directed 564 Obgrid 565 Optical black level compensation 566 OSYS 567 Output system configuration 568 ROI 569 Region of interest 570 YDS 571 Y down sampling 572 YTM 573 Y-tone mapping 574 575A few stages of the pipeline will be executed by firmware running on the ISP 576processor, while many others will use a set of fixed hardware blocks also 577called accelerator cluster (ACC) to crunch pixel data and produce statistics. 578 579ACC parameters of individual algorithms, as defined by 580struct ipu3_uapi_acc_param, can be chosen to be applied by the user 581space through struct struct ipu3_uapi_flags embedded in 582struct ipu3_uapi_params structure. For parameters that are configured as 583not enabled by the user space, the corresponding structs are ignored by the 584driver, in which case the existing configuration of the algorithm will be 585preserved. 586 587References 588========== 589 590.. [#f5] drivers/staging/media/ipu3/include/uapi/intel-ipu3.h 591 592.. [#f1] https://github.com/intel/nvt 593 594.. [#f2] http://git.ideasonboard.org/yavta.git 595 596.. [#f3] http://git.ideasonboard.org/?p=media-ctl.git;a=summary 597 598.. [#f4] ImgU limitation requires an additional 16x16 for all input resolutions 599