User talk:InternetArchiveBot

Add topic
From Meta, a Wikimedia project coordination wiki
(Redirected from Talk:InternetArchiveBot/Problem)
Latest comment: 19 hours ago by Harold in topic False positive dead link


Archive
Archives

Connect with the developers and other users[edit]

Telegram IRC (irc.libera.chat #iabot)

Operation status[edit]

For the most up to date information see the run pages or Wiki Operations Summary on Airtable

  • 🟢 InternetArchiveBot is currently running on 300+ Wikimedia wikis.
  • 🟢 We have moved the management interface to a new server. Please start using iabot.wmcloud.org instead of iabot.toolforge.org. Please let us know if anything broke during this process.
  • 🟡 Testing is stalled on Alemannisch Wikipedia (als), Asturian Wikipedia (ast), and Japanese Wikipedia (ja).
  • 🔴 Bot is approved by disabled indefinitely on Dutch Wikipedia.
  • 🔴 Bot is approved but disabled indefinitely pending software improvements on French Wikipedia (fr), MediaWiki.org, Norwegian Nynorsk Wikipedia (nn), Polish Wikipedia (pl), and Portuguese Wikipedia (pt).

Last updated: 18:38, 10 May 2024 (UTC)

How this page works[edit]

  1. Ask your question in any language. Questions in English or German will receive the fastest responses.
  2. Our team will try to respond within seven days.
  3. Seven days after our response we will mark the thread as resolved. This queues the thread for archiving.
    If our response does not answer your question, you are welcome to remove the "section resolved" tag and write an additional comment.
  4. Seven days after the thread is marked as resolved, it will be archived. Once a thread is archived, it should not be un-archived. Instead, create a new thread and link to the old one.


SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 7 days.


Inaccessible link[edit]

IABot “fixed” a link that it reported as inaccessible here: https://nl.wikipedia.org/w/index.php?title=Chora_%28Patmos%29&diff=66511300&oldid=66454976

However, the link works fine with http on my end. Now I do agree that https is safer (although in this case it was hardly an improvement), but that's no reason to treat a link as “inaccessible”. Mondo (talk) 18:15, 13 December 2023 (UTC)Reply

Hello Mondo. The bot did not necessarily declare the link inaccessible, though the edit summary would indicate that because the bot's edit summaries are very imprecise. The bot upgrades HTTP links to HTTPS where possible, separately from its process of fixing dead links. Harej (talk) 18:20, 13 December 2023 (UTC)Reply
Hello Harej, in that case, it's against the guidelines of the Dutch Wikipedia. We have the guideline “bij twijfel niet inhalen”, which is similar to the one on EN:WP called If it ain't broke, don't fix it, except that ours is much more detailed. The link was not broken and https hardly made a difference with this specific link, therefore it was against the guideline. I have reverted IABot and added the article to the deny list, but I hope this can be fixed, because this will happen again on other pages. Mondo (talk) 18:23, 13 December 2023 (UTC)Reply
.
3750 2409:4081:2E1B:10CF:C8EC:865C:203E:844A 06:20, 16 April 2024 (UTC)Reply

It's not resolved. I explained what the issue was back in December and nothing has changed. Mondo (talk) 19:27, 3 April 2024 (UTC)Reply

Mondo, as explained above, it is our practice to replace HTTP with HTTPS on all wikis, and we are not changing that. Continuing to remove the "section resolved" template will not change this. If changing HTTP to HTTPS is in fact against policy, please cite the policy. Harej (talk) 20:09, 3 April 2024 (UTC)Reply
If you wanted me to cite the policy, it would've been nice to know that when I posted my last comment instead of not responding to me for months. But here you go:

https://nl.wikipedia.org/wiki/Wikipedia:Bij_twijfel_niet_inhalen
“De ene goede variant door de andere goede variant vervangen is geen verbetering of verslechtering, maar een neutrale bewerking. Dergelijke bewerkingen zijn ongewenst”

Which translates to: “Replacing one good variant with another is not an improvement nor the opposite. It's a neutral edit. Such edits are undesirable.

Replacing http with https is exactly that: http works fine, i.e. it's a good variant, which makes it against policy. Now I could see it being somewhat useful if it's a URL where security is of the utmost importance, but in this case it's a link to a spreadsheet file. There's nothing that https will do to protect the user in this case. (Or if the http link was dead and replaced with https.) Mondo (talk) 20:18, 3 April 2024 (UTC)Reply

DOI[edit]

For some reason, the bot reported an error in a DOI link (here), the link is http://dx.doi.org/10.2307/597203 and currently works fine (it redirects me to JSTOR). פעמי-עליון (talk) 11:34, 30 March 2024 (UTC)Reply

Sorry, but I don't see what you are referring to. Please give me a link to a faulty edit that I can review. —CYBERPOWER (Chat) 21:36, 3 April 2024 (UTC)Reply
The bot reported in this edit that the DOI link has an error (it is obviously not true, DOI links are very stable). I thaught you night want to know about it and find the source of this mistake פעמי-עליון (talk) 19:58, 4 April 2024 (UTC)Reply
פעמי-עליון, thank you for the report. The URL in question couldn't be found in our URL database (where links that are checked would be found), so I suppose this was a one-off situation. Please let me know if you see anything like this anywhere else. Harej (talk) 20:35, 10 April 2024 (UTC)Reply
here, as well, two link that are fine. Maybe the problem is with academic papers that are not open-access? פעמי-עליון (talk) 17:19, 14 April 2024 (UTC)Reply
If they aren't open access, they should ideally be marked as such. Not only will it serve to inform readers that it's not readily accessible, but also the bot handles such cases differently and doesn't outright mark them in dead in certain situations. —CYBERPOWER (Chat) 14:36, 15 May 2024 (UTC)Reply
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. —CYBERPOWER (Chat) 21:25, 29 May 2024 (UTC)Reply

Your tool docs are featured as an example in the new Tool Docs Guide![edit]

Hello InternetArchiveBot maintainers, contributors, and fans! I wanted to let you know that I highlighted the InternetArchiveBot documentation as a shining example in the new Tool Docs guide that I just published. Thank you for creating lovely tool documentation that can serve as an example to help others create and improve tool docs :-) This guide was created as part of the Doc Your Tool project for the upcoming 2024 Hackathon. If you're interested, please join that project to work on or talk about tool documentation during the hackathon! TBurmeister (WMF) (talk) 16:52, 16 April 2024 (UTC)Reply

That's exciting to hear, thank you! Harej (talk) 21:59, 24 April 2024 (UTC)Reply

Tool links in older messages broken?[edit]

For example here: https://en.wikipedia.org/wiki/Talk:Thornapple_River#External_links_modified following the last two links gives a 404. I think these links are in a template but didn't track down exactly where they are sourced. ++Lar: t/c 09:43, 27 April 2024 (UTC)Reply

@Harej: do you think you can amend the redirect on Toolforge to remap old links from iabot.toolforge.org/iabot to iabot.wmcloud.org? —CYBERPOWER (Chat) 14:46, 15 May 2024 (UTC)Reply
Looks like it was accidentally broken by GreenC when updating the URL in w:Template:Source check. --Nintendofan885T&Cs apply 08:40, 17 May 2024 (UTC)Reply
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. —CYBERPOWER (Chat) 21:26, 29 May 2024 (UTC)Reply

The bot keep adding archive link where it isn't required.[edit]

Hello, The bot always try to add this link but it isn't needed. It happened like more than 3 times and I had to cancel the change every time. https://web.archive.org/web/20211012034604/https://incubator.wikimedia.org/w/index.php?hidebots=1&translations=filter&hidecategorization=1&hideWikibase=1&limit=50&days=3&title=Special%3ARecentChanges&testwiki=wp%2Fryu&urlversion=2

The unwanted modifications occurs on this page: https://incubator.wikimedia.org/wiki/Wp/ryu/%E3%83%A1%E3%82%A4%E3%83%B3%E3%83%9A%E3%83%BC%E3%82%B8

And this is an example of the unwanted modification. https://incubator.wikimedia.org/w/index.php?title=Wp/ryu/%E3%83%A1%E3%82%A4%E3%83%B3%E3%83%9A%E3%83%BC%E3%82%B8&diff=prev&oldid=6254326 Patronus95 (talk) 04:51, 2 May 2024 (UTC)Reply

I just looked and this seems to have fixed itself. Is there anything more you need me to look at? —CYBERPOWER (Chat) 14:54, 15 May 2024 (UTC)Reply

Spanish[edit]

In Spanish, is it possible that when making the changes in editions like this it put urlmuerta instead of deadurl? Thank you. Vanbasten 23 (talk) 08:07, 2 May 2024 (UTC)Reply

@Vanbasten 23: IABot is reading from this page to get it's data for how to handle citation templates. You will need to flip those values around in the settings as the bot will always default to the first in the list. Once adjusted, the bot will change it's behavior automatically. —CYBERPOWER (Chat) 14:57, 15 May 2024 (UTC)Reply
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. —CYBERPOWER (Chat) 21:26, 29 May 2024 (UTC)Reply

Not archive specific sources[edit]

Hi! Is there a way to add a flag on specific sources so they don't update the archive? There are some citations which are used to cite ongoing information, so the archives are always going to be wrong, if we could add something into the template to stop IABot archiving those specific sources that would be fantastic. Best Wishes, Lee Vilenski (talkcontribs) 09:11, 2 May 2024 (UTC)Reply

@Lee Vilenski: You can actually append {{cbignore}} to the citation in question. It's an invisible template that only serves to tell the bot to keep off a specific reference. —CYBERPOWER (Chat) 14:59, 15 May 2024 (UTC)Reply
That is very useful information. Thank you. AlH42 (talk) 12:41, 23 May 2024 (UTC)Reply
You're welcome. :-)—CYBERPOWER (Chat) 21:27, 29 May 2024 (UTC)Reply
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. —CYBERPOWER (Chat) 21:27, 29 May 2024 (UTC)Reply

PHP Fatal error[edit]

Have setup the Bot myself, but this comes:

PHP message: PHP Fatal error: Uncaught TypeError: mysqli_real_connect(): Argument #6 ($port) must be of type ?int, string given in /app/src/Core/DB.php:275 Justman10000 (talk) 23:27, 4 May 2024 (UTC)Reply

Which page exactly is this occurring on? —CYBERPOWER (Chat) 15:02, 15 May 2024 (UTC)Reply
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. —CYBERPOWER (Chat) 21:27, 29 May 2024 (UTC)Reply

Archive.ph → Archive.today[edit]

Tracked in Phabricator:
Task T361746

https://nl.wikipedia.org/w/index.php?title=Patreon&diff=next&oldid=66920330

and

https://nl.wikipedia.org/w/index.php?title=Prog_(tijdschrift)&diff=next&oldid=66920158

But archive.ph is the same service and the link with ph works fine. This is again a clear violation of the Dutch version of “if it ain't broke, don't fix it” guideline, just like the most recent time we spoke. Mondo (talk) 20:11, 1 February 2024 (UTC)Reply

Mondo, bug report has been filed. Harej (talk) 20:29, 3 April 2024 (UTC)Reply
Thank you. 🙂 Mondo (talk) 20:38, 3 April 2024 (UTC)Reply
I replied in the Phab giving the technical reason why, it's done for functional reasons not cosmetic, archive.today is a special domain that is functionally more reliable then the other ones, and it's also the domain the owners of archive.today requested we use on Wikipedia as a safeguard against potential future outages. -- GreenC (talk) 14:42, 4 April 2024 (UTC)Reply
They can request whatever they want, but at least on the Dutch Wikipedia, changes at the request of owners are seen as an unwanted change and even without their request it's seen as an unwanted change, so something still needs to be done about it. Mondo (talk) 14:57, 4 April 2024 (UTC)Reply
Besides, it looks like the bot doesn't even care for archive.today that much anyway, as it just changed an archive.today URL to archive.is: https://nl.wikipedia.org/w/index.php?diff=prev&oldid=67337586 (the second highlighted reference). I used IABot for this. Mondo (talk) 19:56, 7 April 2024 (UTC)Reply

I am disabling the bot indefinitely on Dutch Wikipedia until this is addressed. Harej (talk) 18:41, 10 May 2024 (UTC)Reply

Thank you for taking action, I really appreciate that. 🙂 Mondo (talk) 19:14, 10 May 2024 (UTC)Reply

Invitations to translate[edit]

Hi! I found two translatable message that say "Sorry, but the language you have picked is not available yet." They invite the user to translate the interface at translatewiki.

Can they ever be shown in other languages? It looks like by their nature, they can only be shown in English, but maybe I'm misunderstanding something. Amir E. Aharoni (talk) 16:37, 11 May 2024 (UTC)Reply

@Amire80:, no. They are hard-coded in English as there is no point in translating them. They will never be shown if the UI has a complete translation, and it will be irrelevant to those who can't understand English anyway as that is the base the translations come from. —CYBERPOWER (Chat) 15:04, 15 May 2024 (UTC)Reply
Great, thanks for the response. Since they can never be shown in their translated form, I'll remove them from the translation workflow. This is only a matter of configuration on translatewiki.net, and no action is needed in IABot code. Amir E. Aharoni (talk) 17:14, 15 May 2024 (UTC)Reply
... I explored it a bit more, and I realized that languageunavailableheader and languageunavailablemessage are always shown only in English, but "incompletetranslationheader" and "incompletetranslationmessage" can be shown when the localization is incomplete, as their name implies. I'll do theconfiguration accordingly. Amir E. Aharoni (talk) 17:31, 15 May 2024 (UTC)Reply
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. —CYBERPOWER (Chat) 21:28, 29 May 2024 (UTC)Reply

Enlaces a nada[edit]

Hola. Aparte del ocurrente "verificar la verificabilidad", que imagino es una traducción no demasiado correcta, ¿de qué sirve enlazar en Internet Archive a digitalizaciones de libros de los que, por tener los derechos reservados, no es posible ver más que la portada y la contraportada? ¿Es para hacer posible comprobar que el libro existe? ¿Y por qué hay que comprobarlo? ¿Porque no nos fiamos del wikipedista que ha puesto la ficha bibliográfica incluido el ISBN? ¿No es suficiente para la comprobación el ISBN? Y como no nos fiamos del wikipedista hacemos perder el tiempo al lector invitándole a pinchar en un enlace que no le va a permitir comprobar nada más de lo que le dice el ISBN. ¿Es esa la idea? Saludos, --Enrique Cordero (talk) 09:53, 12 May 2024 (UTC)Reply

Which book(s)? Normally with books at archive.org you can "search inside", view pages within the book - it is the same reason people add links to Google Books. It's useful for looking up information from the citation. -- GreenC (talk) 16:00, 15 May 2024 (UTC)Reply
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. —CYBERPOWER (Chat) 21:28, 29 May 2024 (UTC)Reply

"locale" parameter[edit]

The translatable message permissionschange says:

Your {{locale}} user permissions have been changed by <a href="{{actionuserlink}}">{{actionuser}}</a>.

What is the {{locale}} parameter?

Also, the qqq documentation says "Email body with HTML and templates", but this is a bit generic. What is the email about exactly? I guess that it's about some permissions that were changed, but where and in what context? Amir E. Aharoni (talk) 20:30, 12 May 2024 (UTC)Reply

locale describes the wiki. For example, if on the tool interface I gave you root, but only for enwiki, locale would be enwiki-English Wikipedia, or whatever the local translation of the Wikipedia name is. Does this help? —CYBERPOWER (Chat) 15:10, 15 May 2024 (UTC)Reply
OK, just to verify: Will the actual string be "enwiki-English Wikipedia"? Or "English Wikipedia"? Or just "enwiki"? Amir E. Aharoni (talk) 17:10, 15 May 2024 (UTC)Reply
I believe it's the first one. —CYBERPOWER (Chat) 18:59, 22 May 2024 (UTC)Reply
Thanks :) Amir E. Aharoni (talk) 22:45, 22 May 2024 (UTC)Reply
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. —CYBERPOWER (Chat) 21:28, 29 May 2024 (UTC)Reply

Link removal in it.wiki[edit]

I noticed that in this edit on Comunità delle Regole di Spinale e Manez the bot removed the non-working link and substituted it with {{Collegamento interrotto}}. -- ZandDev (talk) 16:16, 15 May 2024 (UTC)Reply

Sorry, but this is an old edit. Many changes and improvements have been made to IABot since then. If this keeps happening, please re-report with a new example. —CYBERPOWER (Chat) 19:01, 22 May 2024 (UTC)Reply
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. —CYBERPOWER (Chat) 21:29, 29 May 2024 (UTC)Reply

Parameters on be.wiki[edit]

Hello! Cite templates in Belarusian Wikipedia has been updated, and support |archive-date, |archive-url and |url-status parameters now, like English Wikipedia. Please correct bot behaviour for bewiki: add |archive-date=YYYY-MM-DD |archive-url=... |url-status=dead instead of |archivedate=YYYY-MM-DD |archiveurl=... |deadurl=yes for all Cite templates (be:Template:Cite web, be:Template:Cite book, be:Template:Cite journal etc.) and localized versions: be:Template:Кніга, be:Template:Артыкул, be:Template:Публікацыя, be:Template:Навіна and be:Template:Спасылка.--Artsiom91 (talk) 07:30, 17 May 2024 (UTC)Reply

Hi. I see your wiki has imported the CS1 Citation modules from enwiki. You can actually change your cite templates to use that and localize it very easily. However, this is not required. IABot now reads that Cite template and can use that knowledge to adapt its behavior to non-CS1 templates. It should behave correctly for English language templates. For the other templates, I recommend adding the local aliases to the Citation/CS1/Configuration module for the localized variants, and having them point to CS1 as well. This will make cross-wiki adaptations much easier. —CYBERPOWER (Chat) 20:42, 22 May 2024 (UTC)Reply
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. —CYBERPOWER (Chat) 21:29, 29 May 2024 (UTC)Reply

Enable logging on an external tool[edit]

Hi! In the translatable message "enableAPILogging", what does "logging on" mean?

"Writing to a log"?

Or "Logging into an account"? Amir E. Aharoni (talk) 06:07, 19 May 2024 (UTC)Reply

Writing to a log. :-) —CYBERPOWER (Chat) 20:42, 22 May 2024 (UTC)Reply
Thanks :) Amir E. Aharoni (talk) 22:46, 22 May 2024 (UTC)Reply
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. —CYBERPOWER (Chat) 21:29, 29 May 2024 (UTC)Reply

Can't archive[edit]

Hi IAB admins, so I'm having an issue. With one of my articles, en:Aston Martin Rapide, i've been trying to archive the sources, but it comes up with this. No links were analyzed for some reason. Any reason as to why? 750h+ (talk) 05:08, 25 May 2024 (UTC)Reply

I can't see what you are referring. The image is not showing for me.—CYBERPOWER (Chat) 21:30, 29 May 2024 (UTC)Reply

The bot wrongly claims links are dead[edit]

Hi! The bot keeps claiming that links in the following article are dead, but they are ok, as far as I see. https://el.wikipedia.org/wiki/%CE%95%CE%B8%CE%BD%CE%B9%CE%BA%CE%AE_%CE%95%CE%BB%CE%BB%CE%AC%CE%B4%CE%B1%CF%82_(%CE%A6%CE%B5%CE%BD%CF%84_%CE%9A%CE%B1%CF%80) Can you do something, please? Thank you. --Harry Deconstructing (talk) 12:11, 29 May 2024 (UTC)Reply

Can you please provide more concrete examples?—CYBERPOWER (Chat) 21:32, 29 May 2024 (UTC)Reply
If you check the references, most of the links are deemed dead. However they work. For example
Reference No 25: Greece - Mexico 1 - 2 (1983)[νεκρός σύνδεσμος]
The link is https://www.billiejeankingcup.com/en/draws-and-results/W-FC-1983-WG-M-GRE-MEX-01?matchId=itf_2610164d79ebc202150c3ed3669cb0b6 Harry Deconstructing (talk) 00:32, 30 May 2024 (UTC)Reply

Category:CS1 maint: url-status at EN Wikipedia[edit]

Hello, I was wondering if InternetArchiveBot could go through CS1_maint:_url-status on EN Wikipedia. I checked some of them from the list. Articles like Alan Barinholtz and Football at the 2024 Summer Olympics have a working URL and don't need |url-status=live. So far, I haven't seen an |url-status=dead parameter that's missing an archived URL and archive date. There's over 2,000 articles to check. Thanks! MrLinkinPark333 (talk) 23:33, 30 May 2024 (UTC)Reply

False positive dead link[edit]

Hi, https://geoportal.rsd.cz (see https://cs.wikipedia.org/w/index.php?title=D%C3%A1lnice_D1&diff=prev&oldid=23963893) is not dead, maybe is just geo-restricted. --Harold (talk) 15:53, 31 May 2024 (UTC)Reply