Skip to content

Download and process wiki data for specific category or topic using meta data info #1408

@anushaknvidia

Description

@anushaknvidia

I know NeMo Curator can download a full Wikipedia dump (all articles, not just a single topic’s “root pages”), and you can later filter it to the topics you care about using meta data. Is there a way to filter for a topic and related pages while downloading/processing taking help from meta data?

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions