driver-api/mmc/mmc-async-req.rst

*19024c09SMauro Carvalho Chehab========================
*19024c09SMauro Carvalho ChehabMMC Asynchronous Request
*19024c09SMauro Carvalho Chehab========================
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabRationale
*19024c09SMauro Carvalho Chehab=========
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabHow significant is the cache maintenance overhead?
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabIt depends. Fast eMMC and multiple cache levels with speculative cache
*19024c09SMauro Carvalho Chehabpre-fetch makes the cache overhead relatively significant. If the DMA
*19024c09SMauro Carvalho Chehabpreparations for the next request are done in parallel with the current
*19024c09SMauro Carvalho Chehabtransfer, the DMA preparation overhead would not affect the MMC performance.
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabThe intention of non-blocking (asynchronous) MMC requests is to minimize the
*19024c09SMauro Carvalho Chehabtime between when an MMC request ends and another MMC request begins.
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabUsing mmc_wait_for_req(), the MMC controller is idle while dma_map_sg and
*19024c09SMauro Carvalho Chehabdma_unmap_sg are processing. Using non-blocking MMC requests makes it
*19024c09SMauro Carvalho Chehabpossible to prepare the caches for next job in parallel with an active
*19024c09SMauro Carvalho ChehabMMC request.
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabMMC block driver
*19024c09SMauro Carvalho Chehab================
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabThe mmc_blk_issue_rw_rq() in the MMC block driver is made non-blocking.
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabThe increase in throughput is proportional to the time it takes to
*19024c09SMauro Carvalho Chehabprepare (major part of preparations are dma_map_sg() and dma_unmap_sg())
*19024c09SMauro Carvalho Chehaba request and how fast the memory is. The faster the MMC/SD is the
*19024c09SMauro Carvalho Chehabmore significant the prepare request time becomes. Roughly the expected
*19024c09SMauro Carvalho Chehabperformance gain is 5% for large writes and 10% on large reads on a L2 cache
*19024c09SMauro Carvalho Chehabplatform. In power save mode, when clocks run on a lower frequency, the DMA
*19024c09SMauro Carvalho Chehabpreparation may cost even more. As long as these slower preparations are run
*19024c09SMauro Carvalho Chehabin parallel with the transfer performance won't be affected.
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabDetails on measurements from IOZone and mmc_test
*19024c09SMauro Carvalho Chehab================================================
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho Chehabhttps://wiki.linaro.org/WorkingGroups/Kernel/Specs/StoragePerfMMC-async-req
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabMMC core API extension
*19024c09SMauro Carvalho Chehab======================
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabThere is one new public function mmc_start_req().
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabIt starts a new MMC command request for a host. The function isn't
*19024c09SMauro Carvalho Chehabtruly non-blocking. If there is an ongoing async request it waits
*19024c09SMauro Carvalho Chehabfor completion of that request and starts the new one and returns. It
*19024c09SMauro Carvalho Chehabdoesn't wait for the new request to complete. If there is no ongoing
*19024c09SMauro Carvalho Chehabrequest it starts the new request and returns immediately.
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabMMC host extensions
*19024c09SMauro Carvalho Chehab===================
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabThere are two optional members in the mmc_host_ops -- pre_req() and
*19024c09SMauro Carvalho Chehabpost_req() -- that the host driver may implement in order to move work
*19024c09SMauro Carvalho Chehabto before and after the actual mmc_host_ops.request() function is called.
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabIn the DMA case pre_req() may do dma_map_sg() and prepare the DMA
*19024c09SMauro Carvalho Chehabdescriptor, and post_req() runs the dma_unmap_sg().
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabOptimize for the first request
*19024c09SMauro Carvalho Chehab==============================
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabThe first request in a series of requests can't be prepared in parallel
*19024c09SMauro Carvalho Chehabwith the previous transfer, since there is no previous request.
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabThe argument is_first_req in pre_req() indicates that there is no previous
*19024c09SMauro Carvalho Chehabrequest. The host driver may optimize for this scenario to minimize
*19024c09SMauro Carvalho Chehabthe performance loss. A way to optimize for this is to split the current
*19024c09SMauro Carvalho Chehabrequest in two chunks, prepare the first chunk and start the request,
*19024c09SMauro Carvalho Chehaband finally prepare the second chunk and start the transfer.
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho ChehabPseudocode to handle is_first_req scenario with minimal prepare overhead::
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho Chehab  if (is_first_req && req->size > threshold)
*19024c09SMauro Carvalho Chehab     /* start MMC transfer for the complete transfer size */
*19024c09SMauro Carvalho Chehab     mmc_start_command(MMC_CMD_TRANSFER_FULL_SIZE);
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho Chehab     /*
*19024c09SMauro Carvalho Chehab      * Begin to prepare DMA while cmd is being processed by MMC.
*19024c09SMauro Carvalho Chehab      * The first chunk of the request should take the same time
*19024c09SMauro Carvalho Chehab      * to prepare as the "MMC process command time".
*19024c09SMauro Carvalho Chehab      * If prepare time exceeds MMC cmd time
*19024c09SMauro Carvalho Chehab      * the transfer is delayed, guesstimate max 4k as first chunk size.
*19024c09SMauro Carvalho Chehab      */
*19024c09SMauro Carvalho Chehab      prepare_1st_chunk_for_dma(req);
*19024c09SMauro Carvalho Chehab      /* flush pending desc to the DMAC (dmaengine.h) */
*19024c09SMauro Carvalho Chehab      dma_issue_pending(req->dma_desc);
*19024c09SMauro Carvalho Chehab
*19024c09SMauro Carvalho Chehab      prepare_2nd_chunk_for_dma(req);
*19024c09SMauro Carvalho Chehab      /*
*19024c09SMauro Carvalho Chehab       * The second issue_pending should be called before MMC runs out
*19024c09SMauro Carvalho Chehab       * of the first chunk. If the MMC runs out of the first data chunk
*19024c09SMauro Carvalho Chehab       * before this call, the transfer is delayed.
*19024c09SMauro Carvalho Chehab       */
*19024c09SMauro Carvalho Chehab      dma_issue_pending(req->dma_desc);