Wikidata:Requests for permissions/Bot/RPI2026F1Bot 4
- The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Approved--Ymblanter (talk) 20:08, 4 March 2023 (UTC)[reply]
RPI2026F1Bot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: RPI2026F1 (talk • contribs • logs)
Task/s: Reconstruct npm dependencies
Code: WIP
Function details: For any package with a npm package (P8262), it will get that package's metadata and inspect it's dependencies. Any missing dependency items will be added. If that dependency does not exist on Wikidata, an item will be created for it since if a package is notable enough to be on Wikidata it's likely that it's dependents become notable by extension. The bot will not touch development dependencies but will touch peer dependencies. --RPI2026F1 (talk) 15:05, 6 January 2023 (UTC)[reply]
- Do you have an estimate of how many items this will ultimately create? Can we maybe restrict this to packages with at least a certain number of downloads. BrokenSegue (talk) 17:09, 6 January 2023 (UTC)[reply]
- I'm going to write the initial code soon. It should be trivial to restrict it to something like 100k downloads per week. RPI2026F1 (talk) 19:56, 6 January 2023 (UTC)[reply]
- that sounds like a good number. i'd still be curious how many items we're talking about though. I assume <5000? BrokenSegue (talk) 21:47, 6 January 2023 (UTC)[reply]
- I'll see once I write the code and do a dry run on testwikidata. RPI2026F1 (talk) 01:58, 7 January 2023 (UTC)[reply]
- So I did one run on npm package
vuepress
on testwikidata and did it with no downloads filter, and it created 827 items. RPI2026F1 (talk) 00:24, 8 January 2023 (UTC)[reply]- I did a run with the 100k downloads filter and it created 652 items. I think for the initial batch I should run it in a mode where no items are created, and it just completes missing links for now. I think the problem with dependencies is that you can add one brand new dependency to a package with 5 million downloads per day and just like that the dependency also has 5 million downloads per day. RPI2026F1 (talk) 01:16, 8 January 2023 (UTC)[reply]
- Support sounds good. generally i just think we should be careful about importing a ton of new items. 652 items isn't bad at al. BrokenSegue (talk) 06:44, 8 January 2023 (UTC)[reply]
- I don't think we need to add too many items. A lot of those 652 items are in fact foundational packages that exist in packages everywhere. There's just one problem, and that's types packages. I have filtered out the entire
@types
namespace per a RfD and done another rudimentary check to not add namespaced packages where the package name is justtypes
, but there's no good way of knowing if a package is a type package without human intervention. RPI2026F1 (talk) 17:25, 8 January 2023 (UTC)[reply]- I do suppose that we should import any (direct or indirect) dependencies of packages with existing items. GZWDer (talk) 15:34, 9 January 2023 (UTC)[reply]
- To show the initial impact I'll do a test run on vuepress and all of its dependencies. RPI2026F1 (talk) 01:27, 18 January 2023 (UTC)[reply]
- I do suppose that we should import any (direct or indirect) dependencies of packages with existing items. GZWDer (talk) 15:34, 9 January 2023 (UTC)[reply]
- I don't think we need to add too many items. A lot of those 652 items are in fact foundational packages that exist in packages everywhere. There's just one problem, and that's types packages. I have filtered out the entire
- Support sounds good. generally i just think we should be careful about importing a ton of new items. 652 items isn't bad at al. BrokenSegue (talk) 06:44, 8 January 2023 (UTC)[reply]
- I did a run with the 100k downloads filter and it created 652 items. I think for the initial batch I should run it in a mode where no items are created, and it just completes missing links for now. I think the problem with dependencies is that you can add one brand new dependency to a package with 5 million downloads per day and just like that the dependency also has 5 million downloads per day. RPI2026F1 (talk) 01:16, 8 January 2023 (UTC)[reply]
- that sounds like a good number. i'd still be curious how many items we're talking about though. I assume <5000? BrokenSegue (talk) 21:47, 6 January 2023 (UTC)[reply]
- I'm going to write the initial code soon. It should be trivial to restrict it to something like 100k downloads per week. RPI2026F1 (talk) 19:56, 6 January 2023 (UTC)[reply]
I'd like a status update on this proposal please. As soon as I can get an approval I will run the bot. RPI2026F1 (talk) 00:35, 28 February 2023 (UTC)[reply]
- there's generally a backlog on this getting approved. I'm comfortable with you self-closing this as passed given it's been over a month and only 2 people have commented. BrokenSegue (talk) 02:02, 28 February 2023 (UTC)[reply]
- I am waiting for the test run as promised. Ymblanter (talk) 19:19, 1 March 2023 (UTC)[reply]
- I have made multiple test runs. The latest run can be seen here. RPI2026F1 (talk) 00:42, 3 March 2023 (UTC)[reply]
- I am waiting for the test run as promised. Ymblanter (talk) 19:19, 1 March 2023 (UTC)[reply]