Skip to content

Actions: NVIDIA-NeMo/Curator

Actions

Create PR to main with cherry-pick from release

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
405 workflow runs
405 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

ci: Optimize docker layer and uv with no cache (#1444)
Create PR to main with cherry-pick from release #423: Commit 9f28a59 pushed by thomasdhc
16s main
ci: Address setuptools CVE (#1438)
Create PR to main with cherry-pick from release #422: Commit d3bb54a pushed by thomasdhc
22s main
standardize ID field names across deduplication workflows (#1390)
Create PR to main with cherry-pick from release #421: Commit f5846be pushed by sarahyurick
12s main
[benchmark] Add Video Benchmarks (#1430)
Create PR to main with cherry-pick from release #420: Commit c022a7e pushed by praateekmahajan
11s main
ci: Enable AWS runners (#1388)
Create PR to main with cherry-pick from release #419: Commit d5ef575 pushed by chtruong814
11s main
Exact dedup identification benchmark (#1400)
Create PR to main with cherry-pick from release #418: Commit 4cdf6c7 pushed by praateekmahajan
9s main
[benchmarking] Add Semantic Deduplication Identification (#1410)
Create PR to main with cherry-pick from release #417: Commit 983f11e pushed by praateekmahajan
11s main
Ray Pool Executor (#1415)
Create PR to main with cherry-pick from release #416: Commit 3974061 pushed by lbliii
15s main
Fix vllm API compatibility with Video Pipeline + Upgrade vLLM to 0.14…
Create PR to main with cherry-pick from release #415: Commit 605321b pushed by suiyoubi
14s main
Clarify instructions for downloading the Llama Nemotron Post-Training…
Create PR to main with cherry-pick from release #414: Commit 4d86ee1 pushed by sarahyurick
31s main
Bump pyasn1 from 0.6.1 to 0.6.2 (#1396)
Create PR to main with cherry-pick from release #413: Commit 86c9976 pushed by ayushdg
15s main
[benchmarking] Bug fixes and UX improvements (#1409)
Create PR to main with cherry-pick from release #412: Commit 9735c69 pushed by praateekmahajan
10s main
[tests] Improve speed for SemDedup unit tests #1412
Create PR to main with cherry-pick from release #411: Commit 6348b44 pushed by praateekmahajan
13s main
Add benchmarking for modifiers (#1407)
Create PR to main with cherry-pick from release #410: Commit d177aaa pushed by sarahyurick
11s main
Increases dedup_removal timeout for raydata to 1100s (#1406)
Create PR to main with cherry-pick from release #409: Commit 1679597 pushed by praateekmahajan
10s main
Fixed MegatronTokenizerWriter to download just the tokenizer files …
Create PR to main with cherry-pick from release #408: Commit 723fbd3 pushed by sarahyurick
12s main
Update instructions for AWS credentials in ArXiv download and extract…
Create PR to main with cherry-pick from release #407: Commit 058fa93 pushed by sarahyurick
23s main
Revert "Remove nvenc/dec for xenna 0.1.6 (#1202)" (#1374)
Create PR to main with cherry-pick from release #406: Commit ac77f88 pushed by thomasdhc
16s main
[benchmarking] Update metrics to track for text (#1386)
Create PR to main with cherry-pick from release #405: Commit 3fbb0b1 pushed by praateekmahajan
11s main
Add metrics for ScoreFilter benchmarks (#1385)
Create PR to main with cherry-pick from release #404: Commit 2ac59c8 pushed by sarahyurick
9s main
Add warning for small n_clusters in SemanticDeduplicationWorkflow (#1…
Create PR to main with cherry-pick from release #402: Commit 1ba3e20 pushed by sarahyurick
16s main
Clean up benchmarking scripts (#1382)
Create PR to main with cherry-pick from release #401: Commit 37b1b63 pushed by sarahyurick
9s main
Address aiohttp and urllib3 cve (#1379)
Create PR to main with cherry-pick from release #400: Commit f8e6f79 pushed by ayushdg
18s main
Add benchmarking for ScoreFilter (#1373)
Create PR to main with cherry-pick from release #399: Commit e4fe2a7 pushed by sarahyurick
15s main