Skip to content

Conversation

@liuxuezhao
Copy link
Contributor

@liuxuezhao liuxuezhao commented Dec 29, 2025

1. fix a bug of using ec_agg_boundary before checking its valid
2. add some more logs for rebuild fetch getting zero iod_size,
   to provide some hints for layout information.
3. fix a REINT bug that possibly complete the rebuild ahead before
    reint engines' pull completion.

 Signed-off-by: Xuezhao Liu <xuezhao.liu@hpe.com>

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

@liuxuezhao liuxuezhao requested review from a team as code owners December 29, 2025 09:18
@github-actions
Copy link

Ticket title is 'Data corruption observed with master branch under MDonSSD environment.'
Status is 'In Progress'
Labels: 'md_on_ssd,scrubbed_2.8'
https://daosio.atlassian.net/browse/DAOS-18368

@daosbuild3
Copy link
Collaborator

Test stage NLT on EL 8.8 completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos/job/PR-17324/2/display/redirect

NiuYawei
NiuYawei previously approved these changes Dec 30, 2025
@liuxuezhao liuxuezhao force-pushed the lxz/rb_fix_ec_agg_eph branch from 10fa58e to 66f44a9 Compare December 31, 2025 06:06
@daosbuild3
Copy link
Collaborator

@daosbuild3
Copy link
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17324/4/execution/node/1313/log

@daosbuild3
Copy link
Collaborator

Test stage Functional Hardware Medium Verbs Provider MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17324/4/execution/node/1323/log

@liuxuezhao
Copy link
Contributor Author

just refresh to change a few logs.

@liuxuezhao liuxuezhao requested a review from NiuYawei January 4, 2026 10:47
@liuxuezhao liuxuezhao force-pushed the lxz/rb_fix_ec_agg_eph branch 2 times, most recently from 32db84f to e1c08ea Compare January 4, 2026 11:00
@daosbuild3
Copy link
Collaborator

NiuYawei
NiuYawei previously approved these changes Jan 5, 2026
wangshilong
wangshilong previously approved these changes Jan 5, 2026
1. fix a bug of using ec_agg_boundary before checking its valid
2. add some more logs for rebuild fetch getting zero iod_size,
   to provide some hints for layout information.

 Signed-off-by: Xuezhao Liu <xuezhao.liu@hpe.com>
Signed-off-by: Xuezhao Liu <xuezhao.liu@hpe.com>
rebuild leader possibly treat reint engine as completion before its
pulling DONE.

Signed-off-by: Xuezhao Liu <xuezhao.liu@hpe.com>
@liuxuezhao liuxuezhao dismissed stale reviews from wangshilong and NiuYawei via c474fe4 January 7, 2026 07:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

5 participants