aboutsummaryrefslogtreecommitdiffstats
path: root/arch/sparc
diff options
context:
space:
mode:
authorRik van Riel <riel@redhat.com>2013-12-18 20:08:44 -0500
committerGreg Kroah-Hartman <gregkh@linuxfoundation.org>2014-01-09 15:24:23 -0500
commitd303cf4624824971d94b4e2c7c95df052d14aa81 (patch)
tree710211f3a52e8b0ac893dba86bbbb2afcb3df3c9 /arch/sparc
parent57f74b6ecebf59991677dd2da0f0433e8be6c945 (diff)
mm: fix TLB flush race between migration, and change_protection_range
commit 20841405940e7be0617612d521e206e4b6b325db upstream. There are a few subtle races, between change_protection_range (used by mprotect and change_prot_numa) on one side, and NUMA page migration and compaction on the other side. The basic race is that there is a time window between when the PTE gets made non-present (PROT_NONE or NUMA), and the TLB is flushed. During that time, a CPU may continue writing to the page. This is fine most of the time, however compaction or the NUMA migration code may come in, and migrate the page away. When that happens, the CPU may continue writing, through the cached translation, to what is no longer the current memory location of the process. This only affects x86, which has a somewhat optimistic pte_accessible. All other architectures appear to be safe, and will either always flush, or flush whenever there is a valid mapping, even with no permissions (SPARC). The basic race looks like this: CPU A CPU B CPU C load TLB entry make entry PTE/PMD_NUMA fault on entry read/write old page start migrating page change PTE/PMD to new page read/write old page [*] flush TLB reload TLB from new entry read/write new page lose data [*] the old page may belong to a new user at this point! The obvious fix is to flush remote TLB entries, by making sure that pte_accessible aware of the fact that PROT_NONE and PROT_NUMA memory may still be accessible if there is a TLB flush pending for the mm. This should fix both NUMA migration and compaction. [mgorman@suse.de: fix build] Signed-off-by: Rik van Riel <riel@redhat.com> Signed-off-by: Mel Gorman <mgorman@suse.de> Cc: Alex Thorlton <athorlton@sgi.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Diffstat (limited to 'arch/sparc')
-rw-r--r--arch/sparc/include/asm/pgtable_64.h4
1 files changed, 2 insertions, 2 deletions
diff --git a/arch/sparc/include/asm/pgtable_64.h b/arch/sparc/include/asm/pgtable_64.h
index 7619f2f792af..dfb0019bf05b 100644
--- a/arch/sparc/include/asm/pgtable_64.h
+++ b/arch/sparc/include/asm/pgtable_64.h
@@ -616,7 +616,7 @@ static inline unsigned long pte_present(pte_t pte)
616} 616}
617 617
618#define pte_accessible pte_accessible 618#define pte_accessible pte_accessible
619static inline unsigned long pte_accessible(pte_t a) 619static inline unsigned long pte_accessible(struct mm_struct *mm, pte_t a)
620{ 620{
621 return pte_val(a) & _PAGE_VALID; 621 return pte_val(a) & _PAGE_VALID;
622} 622}
@@ -806,7 +806,7 @@ static inline void __set_pte_at(struct mm_struct *mm, unsigned long addr,
806 * SUN4V NOTE: _PAGE_VALID is the same value in both the SUN4U 806 * SUN4V NOTE: _PAGE_VALID is the same value in both the SUN4U
807 * and SUN4V pte layout, so this inline test is fine. 807 * and SUN4V pte layout, so this inline test is fine.
808 */ 808 */
809 if (likely(mm != &init_mm) && pte_accessible(orig)) 809 if (likely(mm != &init_mm) && pte_accessible(mm, orig))
810 tlb_batch_add(mm, addr, ptep, orig, fullmm); 810 tlb_batch_add(mm, addr, ptep, orig, fullmm);
811} 811}
812 812