md/raid6: Set R5_ReadError when there is read failure on parity disk
authorXiao Ni <xni@redhat.com>
Mon, 8 Jul 2019 02:14:32 +0000 (10:14 +0800)
committerSong Liu <songliubraving@fb.com>
Wed, 7 Aug 2019 17:25:02 +0000 (10:25 -0700)
7471fb77ce4d ("md/raid6: Fix anomily when recovering a single device in
RAID6.") avoids rereading P when it can be computed from other members.
However, this misses the chance to re-write the right data to P. This
patch sets R5_ReadError if the re-read fails.

Also, when re-read is skipped, we also missed the chance to reset
rdev->read_errors to 0. It can fail the disk when there are many read
errors on P member disk (other disks don't have read error)

V2: upper layer read request don't read parity/Q data. So there is no
need to consider such situation.

This is Reported-by: kbuild test robot <lkp@intel.com>

Fixes: 7471fb77ce4d ("md/raid6: Fix anomily when recovering a single device in RAID6.")
Cc: <stable@vger.kernel.org> #4.4+
Signed-off-by: Xiao Ni <xni@redhat.com>
Signed-off-by: Song Liu <songliubraving@fb.com>
drivers/md/raid5.c

index 3de4e13bde984553ac5571a690e93f43ffad5bdf..59cafafd5a5dc14c4917e9e8aaba23c0432b5b4f 100644 (file)
@@ -2558,7 +2558,9 @@ static void raid5_end_read_request(struct bio * bi)
                    && !test_bit(R5_ReadNoMerge, &sh->dev[i].flags))
                        retry = 1;
                if (retry)
-                       if (test_bit(R5_ReadNoMerge, &sh->dev[i].flags)) {
+                       if (sh->qd_idx >= 0 && sh->pd_idx == i)
+                               set_bit(R5_ReadError, &sh->dev[i].flags);
+                       else if (test_bit(R5_ReadNoMerge, &sh->dev[i].flags)) {
                                set_bit(R5_ReadError, &sh->dev[i].flags);
                                clear_bit(R5_ReadNoMerge, &sh->dev[i].flags);
                        } else