feat: add tag-based filtering to dataset hub search#708
Open
Felipedino wants to merge 3 commits into
Open
Conversation
Adds a chip-based tag input to the hub grid so users can AND-filter datasets by tag across HuggingFace, OpenML, and Zenodo. - Backend: new `tags` query param (repeated) on the search endpoint; each source applies it natively (HF `filter=`, OpenML `tag=`, Zenodo `keywords:"…"` Lucene clause) - Frontend: tag chip UI (Enter/comma to add, click X to remove); Axios paramsSerializer fixed to emit repeated params instead of comma-joining, preventing breakage on tags that contain commas - OpenML pagination uses limit+1 sentinel pattern to avoid premature next_cursor=None when a tag filter reduces page size - Zenodo escapes `\` and `"` inside Lucene quoted strings - OpenML multi-tag drop now logged at WARNING level Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Adds a chip-based tag input to the hub grid so users can AND-filter datasets by tag across HuggingFace, OpenML, and Zenodo.
tagsquery param (repeated) on the search endpoint; each source applies it natively (HFfilter=, OpenMLtag=, Zenodokeywords:"…"Lucene clause)\and"inside Lucene quoted strings