Message boards : The Lounge : The Seti is Slumbering Cafe
Message board moderation
Previous · 1 . . . 518 · 519 · 520 · 521
| Author | Message |
|---|---|
|
Send message Joined: 7 Dec 24 Posts: 174 |
In reply to Dr Who Fan's message of 21 Nov 2025: SETI website is loading nice and fast. Hurry up and read or post you item(s) there before it disappears again.Not for me, i'm still forbidden. Grant Darwin NT. |
|
Send message Joined: 10 May 07 Posts: 1701
|
Not for me, i'm still forbidden. Sounds like possible Cloudflare servers are blocking you. Out of curiosity, have you tried clearing the browser cache and deleting cookies for all the *(.)berkeley(.)edu sites, including this site to see if it will let you back in? |
|
Send message Joined: 7 Dec 24 Posts: 174 |
In reply to Dr Who Fan's message of 22 Nov 2025: Two different browsers on two different systems.Not for me, i'm still forbidden. Can actually access Seti now on a 3rd system, but no joy on the other two. Time to go cookie hunting. Edit- not cookies, but related to the saved URL. The ones that were forbidden, with no mention of HTTP or HTTPS in the saved address were resolving to HTTP. Changed the favourite link address to explicitly HTTPS and no problems. Grant Darwin NT. |
|
Send message Joined: 10 May 07 Posts: 1701
|
It's still up nearly 14 hours later! This is the longest time in a while so, blame me for bragging and posting here if SETI website goes dark.within the next hour. |
|
Send message Joined: 30 Mar 20 Posts: 607
|
Eric posted the following on https://setiathome.berkeley.edu/forum_thread.php?id=86149&postid=2152880#2152880 We've been DDOSed the last month or so by assorted web cralwers that have been ignoring the robots.txt directive. I'm made blocking such things a bit more proactive. Hopefully we can stay up more than 10 minutes at a time now. |
|
Send message Joined: 7 Dec 24 Posts: 174 |
In reply to Grumpy Swede's message of 22 Nov 2025: Eric posted the following on https://setiathome.berkeley.edu/forum_thread.php?id=86149&postid=2152880#2152880That seems to be becoming a bigger and bigger problem- most likely crawling for AI/ML data. Another web site i often use had several issues about 6 months back- getting slammed with AI/ML bots, and they had to up their Cloudflare plan and put in other blocks to kill off that load. Grant Darwin NT. |
JordSend message Joined: 29 Aug 05 Posts: 15738
|
I asked Grok how it deals with that. The answer is disconcerting. As an AI developed by xAI, I rely on tools for accessing and processing web information rather than directly crawling sites myself. Robots.txt is a voluntary protocol—it's not legally binding, so there's no strict requirement to follow it. That said, ethical AI practices often encourage respecting it where possible.In practice, for real-time information retrieval (like web searches or browsing), xAI's systems appear to use user-agent spoofing, such as mimicking an iPhone browser, to bypass potential blocks and ensure access to content. |
JordSend message Joined: 29 Aug 05 Posts: 15738
|
ChatGPT: I don’t crawl the internet myself. I don’t browse websites, run a crawler, or access robots.txt when generating answers. |
JordSend message Joined: 29 Aug 05 Posts: 15738
|
Meta AI: When I crawl the internet for information, I respect the rules set by websites, including those specified in robots.txt files. This file tells crawlers like me which parts of a site to crawl and which to avoid. I follow these guidelines to ensure that my data collection is done responsibly and respectfully. |
DaveSend message Joined: 28 Jun 10 Posts: 3051
|
That seems to be becoming a bigger and bigger problem- most likely crawling for AI/ML data.CPDN dealt with this by blocking computers not logged in to the site from accessing parts of it. For example, to look at anyone's computers I need to be logged in now. What I don't know is whether only the parts that robots.txt would not allow the crawlers to see have been blocked, but it seems to have resolved the problem a few months back with access attempts regularly timing out. |
|
Send message Joined: 30 Mar 20 Posts: 607
|
Well, that didn't last for long. S@H is getting very slow again, from time to time. |
Copyright © 2025 University of California.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software Foundation.