-
Notifications
You must be signed in to change notification settings - Fork 51
Validate and update links (STF-557) #387
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,32 @@ | ||
| name: Links | ||
|
|
||
| on: | ||
| push: | ||
| pull_request: | ||
| schedule: | ||
| - cron: "0 13 * * 1" # weekly, to catch external link rot without a commit | ||
| workflow_dispatch: | ||
|
|
||
| permissions: | ||
| contents: read | ||
|
|
||
| jobs: | ||
| linkChecker: | ||
| runs-on: ubuntu-latest | ||
| steps: | ||
| - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6.0.2 | ||
| with: | ||
| persist-credentials: false | ||
|
|
||
| - name: Setup mise | ||
| uses: jdx/mise-action@6d1e696aa24c1aa1bcc1adea0212707c71ab78a8 # v3.6.1 | ||
| with: | ||
| install: false | ||
|
|
||
| # Install only lychee (not the repo's full toolchain) and run the check. | ||
| - name: Check links | ||
| env: | ||
| MISE_AUTO_INSTALL: "false" | ||
| run: | | ||
| mise install lychee | ||
| mise run check-links |
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -23,3 +23,4 @@ target | |
| /sample/run.sh | ||
| reports | ||
| Test.java | ||
| .lycheecache | ||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,63 @@ | ||
| # Lychee link checker configuration | ||
| # https://lychee.cli.rs/#/usage/config | ||
| # | ||
| # Run locally with: | ||
| # lychee './**/*.md' './src/**/*.java' './pom.xml' | ||
|
|
||
| # Include URL fragments in checks | ||
| include_fragments = true | ||
|
|
||
| # Don't allow any redirects, so links that have moved are surfaced and updated | ||
| # to their canonical destination. | ||
| max_redirects = 0 | ||
|
|
||
| # Accept these HTTP status codes | ||
| # 100-103: Informational responses | ||
| # 200-299: Success responses | ||
| # 403: Forbidden (some sites use this for rate limiting) | ||
| # 429: Too Many Requests | ||
| # 500-599: Server errors (temporary issues shouldn't fail CI) | ||
| # 999: LinkedIn's custom status code | ||
| accept = ["100..=103", "200..=299", "403", "429", "500..=599", "999"] | ||
|
|
||
| # Exclude URL patterns from checking (treated as regular expressions) | ||
| exclude = [ | ||
| '^file://', | ||
| # Live / auth-gated endpoints that appear as string literals or require login | ||
| '^https://geoip\.maxmind\.com', | ||
| '^https://geolite\.info', | ||
| '^https://minfraud\.maxmind\.com', | ||
| '^https://sandbox\.maxmind\.com', | ||
| '^https://updates\.maxmind\.com', | ||
| '^https://www\.maxmind\.com/en/accounts/', | ||
| 'https://www\.maxmind\.com/en/account/login', | ||
| # XML namespace identifiers in pom.xml (not real links) | ||
| '^http://www\.w3\.org/', | ||
| '^http://maven\.apache\.org/', | ||
| '^https://maven\.apache\.org/xsd/', | ||
| '^http://java\.sun\.com/', | ||
| '^http://schemas\.', | ||
| # Maven property placeholder in a build-time download URL (not a real link) | ||
| 'japicmp\.baselineVersion', | ||
| # Placeholders / local | ||
| '^https?://example\.(com|org|net)', | ||
| '^http://localhost', | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Member
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
— Claude (posted on Greg's behalf) |
||
| '127\.0\.0\.1', | ||
| ] | ||
|
|
||
| # Exclude file paths from getting checked (treated as regular expressions) | ||
| exclude_path = [ | ||
| '(^|/)node_modules/', | ||
| '(^|/)target/', | ||
| # Test fixtures (MaxMind-DB submodule) contain example URLs, not ours | ||
| '(^|/)src/test/resources/', | ||
| # Changelog: historical entries are preserved as-is, not rewritten | ||
| '(^|/)CHANGELOG\.md$', | ||
| ] | ||
|
|
||
| # Cache results for 1 day to speed up repeated checks | ||
| cache = true | ||
| max_cache_age = "1d" | ||
|
|
||
| # Skip missing input files instead of erroring | ||
| skip_missing = true | ||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For consistency with the other URL patterns in the
excludelist, and to prevent accidental partial matches, consider anchoring this pattern with^.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These
excludeentries are matched against full URLs, so a leading^is effectively a no-op here. Left as-is to match the dev-site/blog-site config style.— Claude (posted on Greg's behalf)