drm/i915: Add SCRATCH1 and ROW_CHICKEN3 to the register whitelist.
authorFrancisco Jerez <currojerez@riseup.net>
Fri, 29 May 2015 13:44:15 +0000 (16:44 +0300)
committerJani Nikula <jani.nikula@intel.com>
Mon, 15 Jun 2015 13:00:48 +0000 (16:00 +0300)
Only bit 27 of SCRATCH1 and bit 6 of ROW_CHICKEN3 are allowed to be
set because of security-sensitive bits we don't want userspace to mess
with.  On HSW hardware the whitelisted bits control whether atomic
read-modify-write operations are performed on L3 or on GTI, and when
set to L3 (which can be 10x-30x better performing than on GTI,
depending on the application) require great care to avoid a system
hang, so we currently program them to be handled on GTI by default.

Beignet can immediately start taking advantage of this change to
enable L3 atomics.  Mesa should eventually switch to L3 atomics too,
but a number of non-trivial changes are still required so it will
continue using GTI atomics for now.

Signed-off-by: Francisco Jerez <currojerez@riseup.net>
Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>
Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: Jani Nikula <jani.nikula@intel.com>
drivers/gpu/drm/i915/i915_cmd_parser.c

index cafa3e2f16fc70dd181244449a931659f23e76dd..306d9e4e5cf376bc57901d665622a8b16fdce4d9 100644 (file)
@@ -463,6 +463,13 @@ static const struct drm_i915_reg_descriptor gen7_render_regs[] = {
        REG32(GEN7_L3SQCREG1),
        REG32(GEN7_L3CNTLREG2),
        REG32(GEN7_L3CNTLREG3),
+       REG32(HSW_SCRATCH1,
+             .mask = ~HSW_SCRATCH1_L3_DATA_ATOMICS_DISABLE,
+             .value = 0),
+       REG32(HSW_ROW_CHICKEN3,
+             .mask = ~(HSW_ROW_CHICKEN3_L3_GLOBAL_ATOMICS_DISABLE << 16 |
+                        HSW_ROW_CHICKEN3_L3_GLOBAL_ATOMICS_DISABLE),
+             .value = 0),
 };
 
 static const struct drm_i915_reg_descriptor gen7_blt_regs[] = {