Skip to content

Commit

Permalink
ocfs2: o2hb: don't negotiate if last hb fail
Browse files Browse the repository at this point in the history
Sometimes io error is returned when storage is down for a while.  Like
for iscsi device, stroage is made offline when session timeout, and this
will make all io return -EIO.  For this case, nodes shouldn't do
negotiate timeout but should fence self.  So let nodes fence self when
o2hb_do_disk_heartbeat return an error, this is the same behavior with
o2hb without negotiate timer.

Signed-off-by: Junxiao Bi <[email protected]>
Reviewed-by: Ryan Ding <[email protected]>
Reviewed-by: Mark Fasheh <[email protected]>
Cc: Gang He <[email protected]>
Cc: rwxybh <[email protected]>
Cc: Joel Becker <[email protected]>
Cc: Joseph Qi <[email protected]>
Signed-off-by: Andrew Morton <[email protected]>
Signed-off-by: Linus Torvalds <[email protected]>
  • Loading branch information
biger410 authored and torvalds committed May 27, 2016
1 parent 1bd1290 commit 88dbe98
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions fs/ocfs2/cluster/heartbeat.c
Original file line number Diff line number Diff line change
Expand Up @@ -284,6 +284,9 @@ struct o2hb_region {
/* Message key for negotiate timeout message. */
unsigned int hr_key;
struct list_head hr_handler_list;

/* last hb status, 0 for success, other value for error. */
int hr_last_hb_status;
};

struct o2hb_bio_wait_ctxt {
Expand Down Expand Up @@ -395,6 +398,12 @@ static void o2hb_nego_timeout(struct work_struct *work)
struct o2hb_region *reg;

reg = container_of(work, struct o2hb_region, hr_nego_timeout_work.work);
/* don't negotiate timeout if last hb failed since it is very
* possible io failed. Should let write timeout fence self.
*/
if (reg->hr_last_hb_status)
return;

o2hb_fill_node_map(live_node_bitmap, sizeof(live_node_bitmap));
/* lowest node as master node to make negotiate decision. */
master_node = find_next_bit(live_node_bitmap, O2NM_MAX_NODES, 0);
Expand Down Expand Up @@ -1228,6 +1237,7 @@ static int o2hb_thread(void *data)
before_hb = ktime_get_real();

ret = o2hb_do_disk_heartbeat(reg);
reg->hr_last_hb_status = ret;

after_hb = ktime_get_real();

Expand Down

0 comments on commit 88dbe98

Please sign in to comment.