gpu: nvgpu: limit PD cache to < pgsize for linux

For Linux, limit the use of the cache to entries less than the page size, to avoid potential problems with running out of CMA memory when allocating large, contiguous slabs, as would be required for non-iommmuable chips. Also, in nvgpu_pd_cache_do_free(), zero out entries only if iommu is in use and PTE entries use the cache (since it's the prefetch of invalid PTEs by iommu that needs to be avoided). Bug 3093183 Bug 3100907 Change-Id: I363031db32e11bc705810a7e87fc9e9ac1dc00bd Signed-off-by: Peter Daifuku <pdaifuku@nvidia.com> Reviewed-on: https://git-master.nvidia.com/r/c/linux-nvgpu/+/2422039 Reviewed-by: automaticguardword <automaticguardword@nvidia.com> Reviewed-by: Alex Waterman <alexw@nvidia.com> Reviewed-by: Dinesh T <dt@nvidia.com> Reviewed-by: Satish Arora <satisha@nvidia.com> Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com> GVS: Gerrit_Virtual_Submit Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>
author: Peter Daifuku <pdaifuku@nvidia.com> 2020-09-30 14:25:05 -0400
committer: mobile promotions <svcmobile_promotions@nvidia.com> 2020-10-06 13:10:02 -0400
commit: 5a948ccca95bcecf9d1e81db02394134f8a18c38 (patch)
tree: fb9e43f6750d9c804e5eb8b161a1c634428f9914 /drivers/gpu/nvgpu/common/mm
parent: cd134bb198d7138a3c2fcb17d11f2eedf934e2c4 (diff)
1 files changed, 11 insertions, 4 deletions
diff --git a/drivers/gpu/nvgpu/common/mm/pd_cache.c b/drivers/gpu/nvgpu/common/mm/pd_cache.c
index a5b3d134..8f7003e5 100644
--- a/drivers/gpu/nvgpu/common/mm/pd_cache.c
+++ b/drivers/gpu/nvgpu/common/mm/pd_cache.c
@@ -423,12 +423,19 @@ static void nvgpu_pd_cache_do_free(struct gk20a *g,
                 * this just re-adds it.
                 *
                 * Since the memory used for the entries is still mapped, if
-                 * igpu make sure the entries are invalidated so that the hw
+                 * iommu is being used,  make sure PTE entries in particular
-                 * doesn't accidentally try to prefetch non-existent fb memory.
+                 * are invalidated so that the hw doesn't accidentally try to
+                 * prefetch non-existent fb memory.
                 *
-                 * TBD: what about dgpu? (Not supported in Drive 5.0)
+                 * Notes:
+                 *   - The check for NVGPU_PD_CACHE_SIZE > PAGE_SIZE effectively
+                 *     determines whether PTE entries use the cache.
+                 *   - In the case where PTE entries ues the cache, we also
+                 *     end up invalidating the PDE entries, but that's a minor
+                 *     performance hit, as there are far fewer of those
+                 *     typically than there are PTE entries.
                 */
-                if (pd->mem->cpu_va != NULL) {
+                if (nvgpu_iommuable(g) && (NVGPU_PD_CACHE_SIZE > PAGE_SIZE)) {
                        memset((void *)((u64)pd->mem->cpu_va + pd->mem_offs), 0,
                                        pentry->pd_size);
                }
author	Peter Daifuku <pdaifuku@nvidia.com>	2020-09-30 14:25:05 -0400
committer	mobile promotions <svcmobile_promotions@nvidia.com>	2020-10-06 13:10:02 -0400
commit	5a948ccca95bcecf9d1e81db02394134f8a18c38 (patch)
tree	fb9e43f6750d9c804e5eb8b161a1c634428f9914 /drivers/gpu/nvgpu/common/mm
parent	cd134bb198d7138a3c2fcb17d11f2eedf934e2c4 (diff)