Yes, with a major caveat. An instance will search only communities that at least one user on the instance is subscribed to and only as far back as the time the first user on the instance subscribed to the community.
Lemmynet’s design structure has some weird choices in it, motivated either by laziness or to keep garage servers from being overwhelmed, and that’s the biggest and weirdest one. I’d like to see federation = full and complete synchonization from server launch to present, but I doubt the motivation is there to implement it. Maybe things will be different when kbin eventually surpasses Lemmy
Interesting, can you explain a little more? I’m very curious.
Yes, but it’s not as easy as appending “lemmy”. Here, the content is spread over a wide number of URLs, and unlike reddit, they don’t rank super high in the results.
But thanks to federation, search engines should eventually pick up most content, even when concentrating on just a few big instances. I’m guessing that eventually, you’ll be able to do a search with
site:lemmy.ml
and get most results (as with that many users, chances are high any other post has been copied via federation). Until then, you might need a more widespread search, I created a lens for kagi (paid search engine), that searches the biggest instances: kbin.social, lemmy.ml, lemmy.world, beehaw.org, sh.itjust.works, lemm.ee, programming.devThe result is this:
That does not work with
site:lemmy.ml
and would instead requiresite:lemmy.world
on google.Caveat: Duplicate content detection might really throw things off. Normally, if two sites have the same content, but different URLs and no
canonical
html reference, then one will be detected as spam (or should be), I’m not sure how search engine indexing will be affected by this in the future.Thank you for this. I also use Kagi so im totally gonna set the lens up.
I checked, the
!lemmy
bang is not in use ;)What’s Kagi?
It’s a subscription-based search engine. From what I’ve seen, the results seem pretty good compared to everything else at this point, but IDK if I’d want to pay monthly for search (though honestly I’m seriously considering it at this point).
As someone who’s been a paid user for a few months now, it is totally worth it. It has its moments where I need to try another engine, but the vast majority of the time it’s way better. The ability to rank sites is a game changer, and as a bonus I can block Pinterest and the usual fluff from results and never see them again.
I bet it still respects search modifiers? Both DDG and Google frequently ignore mine.
That’s interesting that lemmy doesn’t generate canonicals. I would have thought that the original instance something is posted on would set the canonical, and other instances can point back to that - it really seems like this sort of problem is exactly what canonicals are made for. Does anyone know if there’s a reason for not using them (other than dev time, which is 100% a good reason)?
Oh snap, + 1 for Kagi usage, I’ll try your implementation
I’d say the short answer is no and the long answer is yes.
Searching across instances is difficult, for the reason RandomBit mentioned in this thread. But you don’t go to reddit and expect your search results to include Hacker News, Twitter, etc. When you search on reddit, it also only searches the local instance, it just is that there is only one instance. So the search is exactly the same as reddit.
With that said, there is probably room for a service that provides cross-instance search by subscribing and indexing communities like a crawler, rather than relying on users to create the federation.
FediSearch I guess is similar to your idea, though I think the goal would be to make a new and open search index specifically containing fediverse websites instead of just using Google. I also feel like the formatting should be more like Lemmy, with the particular post title and short description showing instead of the generic search UI.
The idea of a fediverse search is really cool though. If things like news and academic papers ever got their own fediverse-connected service, I could see a FediSearch being a great alternative to the AI sludge of Google.
Not yet, I don’t think
I think the different domains and levels of federation makes it less convenient. Personally, I think the Fediverse should collaborate on a shared wiki for community knowledge and whatnot.
Any web search seaches Lemmy as well. Just seach for your user ID or display name to find your own content.
I guess what I want to learn is that if I were to type into Google “what are the best iPhone games Reddit” I would get a bunch of Reddit threads does the same thing work if I were to end the question and “lemmy”
That would be the question. You can certainly do a site search. At least duckduckgo can do that. It may not be so easy to just search lemmy, the threadiverse, or the fediverse for example. Do not know.
Probably if the instances have lemmy in their domains like mine, or lemmy.ml, lemmy.ca, or in their name (mine is “pe1uca’s lemmy” so if the domain was different maybe search engines could also work with that), but not for ones like, lemmit.online, sh.itjust.works which don’t have lemmy neither in their names nor domain.
It’d be like searching for content posted in sites which use wordpress.Give it a try and let us know.
Maybe using brave search or something?
Whoever is in charge of discuss.tchncs.de has converted the community search into a lemmy content search
oh, so that isn’t how it is supposed to be? I was wondering how to search for communities, this is the only Lemmy instance I have ever used, maybe I should pick a different one?