diff options
author | Deepak Nibade <dnibade@nvidia.com> | 2018-03-16 08:24:55 -0400 |
---|---|---|
committer | mobile promotions <svcmobile_promotions@nvidia.com> | 2018-03-21 09:04:35 -0400 |
commit | 66751bc05d7a1efca3668d59a2820e3e92985f91 (patch) | |
tree | ab85f317d77c8c76d6a7430039d19d406b9eb8f5 /drivers/gpu/nvgpu/gk20a/gr_gk20a.h | |
parent | c5ca711f1efbd30fa760df139f3b63aa471d28a9 (diff) |
gpu: nvgpu: gv100: fix num_fbpas while adding ctxsw buffer entries
For LIST_nv_pm_fbpa_ctx_regs, we right now call
add_ctxsw_buffer_map_entries_subunits() to add registers corresponding
to all the FBPAs
But while configuring total number of registers, we do not consider
floorswept FBPAs and that causes misalignment in subsequent lists for GV100
Fix this by reading disabled/floorswept FBPAs from fuse and consider only those
FBPAs which are active for GV100
Add new HAL (*add_ctxsw_reg_pm_fbpa) to support this setting and define a
common HAL gr_gk20a_add_ctxsw_reg_pm_fbpa() for all chips except GV100
Define GV100 specific gr_gv100_add_ctxsw_reg_pm_fbpa() with above mentioned
implementation to consider floorsweeping
Bug 1998067
Change-Id: Id560551bb0b8142791c117b6d27864566c90b489
Signed-off-by: Deepak Nibade <dnibade@nvidia.com>
Reviewed-on: https://git-master.nvidia.com/r/1676654
Reviewed-by: svc-mobile-coverity <svc-mobile-coverity@nvidia.com>
Reviewed-by: Terje Bergstrom <tbergstrom@nvidia.com>
GVS: Gerrit_Virtual_Submit
Reviewed-by: Vijayakumar Subbu <vsubbu@nvidia.com>
Reviewed-by: mobile promotions <svcmobile_promotions@nvidia.com>
Tested-by: mobile promotions <svcmobile_promotions@nvidia.com>
Diffstat (limited to 'drivers/gpu/nvgpu/gk20a/gr_gk20a.h')
-rw-r--r-- | drivers/gpu/nvgpu/gk20a/gr_gk20a.h | 6 |
1 files changed, 6 insertions, 0 deletions
diff --git a/drivers/gpu/nvgpu/gk20a/gr_gk20a.h b/drivers/gpu/nvgpu/gk20a/gr_gk20a.h index 7f89037e..5ac363e1 100644 --- a/drivers/gpu/nvgpu/gk20a/gr_gk20a.h +++ b/drivers/gpu/nvgpu/gk20a/gr_gk20a.h | |||
@@ -811,4 +811,10 @@ u32 gk20a_init_sw_bundle(struct gk20a *g); | |||
811 | int gr_gk20a_fecs_ctx_image_save(struct channel_gk20a *c, u32 save_type); | 811 | int gr_gk20a_fecs_ctx_image_save(struct channel_gk20a *c, u32 save_type); |
812 | int gk20a_gr_handle_semaphore_pending(struct gk20a *g, | 812 | int gk20a_gr_handle_semaphore_pending(struct gk20a *g, |
813 | struct gr_gk20a_isr_data *isr_data); | 813 | struct gr_gk20a_isr_data *isr_data); |
814 | int gr_gk20a_add_ctxsw_reg_pm_fbpa(struct gk20a *g, | ||
815 | struct ctxsw_buf_offset_map_entry *map, | ||
816 | struct aiv_list_gk20a *regs, | ||
817 | u32 *count, u32 *offset, | ||
818 | u32 max_cnt, u32 base, | ||
819 | u32 num_fbpas, u32 stride, u32 mask); | ||
814 | #endif /*__GR_GK20A_H__*/ | 820 | #endif /*__GR_GK20A_H__*/ |