Hey Brad,So y'all can relax.
EvanM passed a note our way from LJ Tech Support, regarding Blog Search and its accidental indexing of "noindex" LJ content. Just wondering if you guys could let your users know that this was entirely unintentional, and a fix should go live within the next day or two? (hopefully tomorrow)
Thanks,
E
(I'm also talking to them about RSS/Atom specs for indicating noindex so they don't have to hit up HTML to learn about it.)
And please, people, stop spreading paranoia: they're not using RSS as a "workaround" to not obey robots.txt and noindex... that's just silly on so many levels.
Remember the golden rule on the Internets:
Never attribute to malice what can be adequately explained by stupidity.... or in this case, an accident.
September 16 2005, 04:20:29 UTC 6 years ago
September 16 2005, 04:20:54 UTC 6 years ago
Not that I complained in the first place. I welcome all robots to my journal. =)
September 16 2005, 08:22:53 UTC 6 years ago
I amz az robotz. Myz Prime Directivez are to Indexz your LivezJournalz.6 years ago
6 years ago
6 years ago
September 16 2005, 04:26:16 UTC 6 years ago
September 16 2005, 04:46:38 UTC 6 years ago
September 16 2005, 04:53:02 UTC 6 years ago
neat
It's nice to know that a press conference and involvement from grass roots organizations was not required.September 16 2005, 05:56:37 UTC 6 years ago
September 16 2005, 08:47:02 UTC 6 years ago
6 years ago
6 years ago
September 16 2005, 06:15:35 UTC 6 years ago
This seems like a good opportunity to solve this properly with HTTP headers:
Aside from the obvious benefit that it can then apply to any media type including images, having it “out of band” means that the code to handle it can be centralised to
LJ::make_journalrather than duplicating it in S1, S2 and talkread.bml. Still needs to go in a few awkward BML pages, but it's still a win. (Of course, the old robots blocking will no dout have to stay where it is for the benefit of those mythical “other search engines” I've heard about.)If you just come up with some half-baked solution specific to RSS and Atom we'll be doing this dance again soon enough. For the people who are using stunted webservers and can't set such things, the problem for the Atom/RSS folks then becomes a way to do
http-equivlike HTML does, allowing these header fields to be embedded into the document. That doesn't have to be LJ's problem, though.September 16 2005, 06:33:29 UTC 6 years ago
My other thought of the day is the need for something more specific than whole document inclusion/exclusion, in light of aggregations like
atom-stream.xml. I like the idea of an XML attribute. For example:<feed xmlns='http://www.w3.org/2005/Atom' r:index="no" xmlns:r="http://namespace/robot/">6 years ago
6 years ago
September 16 2005, 07:07:50 UTC 6 years ago
September 16 2005, 07:18:14 UTC 6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
September 16 2005, 12:23:22 UTC 6 years ago
September 16 2005, 14:38:00 UTC 6 years ago
September 16 2005, 18:16:04 UTC 6 years ago
September 16 2005, 20:40:54 UTC 6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
September 16 2005, 22:04:27 UTC 6 years ago
If someone doesn't want their entry to be accessible, it should not be public. Period.
September 16 2005, 22:20:41 UTC 6 years ago
Your whole "point of the internet" emotive argument is stupid anyway. The internet serves many purposes and one of them is not to have 100% of information accessible via search engines. Do you want your medical records accessible via search engines? No? What's wrong? I thought the point of the internet was to have everything accessible via a search engine.
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
6 years ago
September 17 2005, 04:03:35 UTC 6 years ago
any idea on when this will change and why it happened?
September 17 2005, 05:49:08 UTC 6 years ago
September 17 2005, 23:53:33 UTC 6 years ago
I know I'm an ignoramus about these things, but I'm still shocked that our journal cannot be protected from the unscrupulosity or incompetence of the people who run the search engines. I had assumed our privacy was fully protected by Live Journal which I trust.
September 19 2005, 19:55:48 UTC 6 years ago
It also indexes on the Journal Title
My journal has the Title of Unquiet Ether.Do a search on that, and I'm turning up all over the place, although Hilltop isn't.
September 19 2005, 22:13:29 UTC 6 years ago
Time to ditch Live Journal? I and a bunch of my friends are all feeling the same, I think this really needs to be adressed in a more serious fashion and not fobbed off as 'hysterical users'.
When friends only posts still show on the search there is an issue with your security code, no?
September 22 2005, 21:33:26 UTC 6 years ago
September 24 2005, 06:41:21 UTC 6 years ago
If you want to suggest that this feature be added to LiveJournal, you can offer it up at
September 29 2005, 02:14:22 UTC 6 years ago