Codeberg CI not executing any tasks for the past days #1316
Labels
No Label
bug
Codeberg
contribution welcome
docs
duplicate
enhancement
infrastructure
legal
licence / ToS
public relations
question
s/Gitea/Forgejo
s/Pages
s/Weblate
s/Woodpecker
security
service
spam
upstream
wontfix
No Milestone
No Assignees
4 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: Codeberg/Community#1316
Loading…
Reference in New Issue
There is no content yet.
Delete Branch "%!s(<nil>)"
Deleting a branch is permanent. Although the deleted branch may exist for a short time before cleaning up, in most cases it CANNOT be undone. Continue?
Comment
I (And according to chats on matrix some others) encounter an issue right now with the Codeberg CI instance.
The issue is, that the CI does start a run, but not execute any of its set jobs... No matter how long it has been running.
As an example, a recent run I started has not yet even cloned the repository 12 hours after it has been triggered.
I would apreciate if the Codeberg team could look into this as I believe there is a backend-issue in the CI causing jobs to be queued but never run...
I've been having the same thing for a month or so; all CI triggers appear to be running on the repo but are getting back a 410 from CI. I can manually run jobs fine.
Hitting "Repair Repository" in the CI settings appears to re-create the webhook on the repo and then everything works fine. It's a bit annoying to have to do this for every repo, but for anyone finding this it should fix your builds. If that doesn't work you may have to manually remove the webhook in the repo settings then hit repair repo in CI.
I would also appreciate this to be fixed, because breaking changes like this lead to very poor user experience. But there is a known workaround, and I heard that it was related to an actual edge case / bug that the "Repair all" action failed on our CI instance.
I suppose we can close this now.
Am I to understand that this won't be fixed? I really don't want to go through every single repo (there are a lot of them) and hit the repair button. I know it sounds silly, but it would be very tedious and there must be a way to just update the webhook for affected repos? Can we at least get a description of what the issue was?
Thanks for your understanding; I'm sorry if I sound frustrated but this has been really hurting for a month now so the abrupt closure of this issue is a bit disheartening. @fnetX
Pinging @pat-s and @6543 again who are maintaining the Woodpecker CI. It is probably tracked elsewhere though …
My personal guess is, that when Codeberg updated their CI/infrastructure, it caused the currently used webhooks, or probs more the tokens, to expire and no longer be valid, since the webhook and webhook secret contain a token for validation.
I've hit the "repair all" button again on the admin interface which again failed. Unfortunately, I don't have access to the VM and the logs yet, so I can't debug further.
But maybe this already helped for some repos? (I don't know whether if does some partial work or fails for all right from the start).