I’m trying to set up a personal Lemmy instance, and I’ve got it running but it doesn’t seem to sync very well with posts and comments made before the instance was created. I ran [lemmony] (https://github.com/jheidecker/lemmony) to get the /all to work correctly and to start syncing communities, but now when I go to some communities and I look at the posts created before I subscribed to the community, they either don’t show up or don’t have the correct number of upvotes/comments. Also, when I search for communities, next to the community name is only the number of users from my instance subscribed, not the actual number of subscribers to the community. Is there a way to fix this?
I understand the goal of the tool, the defaults are a really bad approach at achieving it and the docs are really bad at identifying the pitfalls. A tool that subscribes to a list of communities provided in a text file would be great. Subscribing to the entire lemmyverse is a solution that creates problems that are worse than the discovery problem.
All of this content seems fairly clearly to me to fall into the category “content that can cause legal liability for the hoster depending on their jurisdiction”. Is that a controversial point of debate?
This all sounds eminently reasonable. 800 subs is a lot, but it’s much more reasonable than the 7k subs this tool leaves you with in it’s default config, and if you further curate it manually and that’s what it takes for your feed to feel lively… then go for it.
Maybe consider releasing it? I totally agree that community discovery is rough all over, and moreso on tiny instances. A tool to help folks bootstrap 50-200 communities and that did a good job documenting the tradeoffs of oversubscription and helped folks identify/avoid legal risk would be a huge step up from the “subscribe all” approach.
Content is NOT served from the original instance. https://lemmy.world/post/1191149 shows a post that was made to a community on
lemmy.ml
. Because there are subscribers onlemmy.world
, that post is replicated there. Any unauthenticated user on the internet can view that post, the content was pulled out of the db onlemmy.world
and sent out fromlemmy.world
’s ip and over its internet connection. By every legal definition I’ve encounteredlemmy.world
is serving that post and subject to any legal complications that entails. The only exception I’m aware of are full-size images, which don’t replicate. Thumbnails do though, so that provides no protection. You host the image content, just at reduced quality.This is also not true. In the US, you have to register a copyright agent to receive the kinds of protection typically associated with commercial hosts. If you fail to do so, I believe that you run the risk of just getting sued out of the gate for copyright issues. There are also almost certainly jurisdictions where hosting gay porn or certain political speech is a “straight to jail” kind of maneuver.
Of course, I have no evidence that OP is in a particularly dangerous jurisdiction. But my broader point is that new users of single-user instance often don’t consider that they may be signing up to host legally risky content that they themselves didn’t create, view, or want. If one curates their list of subs, they can gauge for themselves what communities they consider to be risky. If they “subscribe all”, they WILL be serving to the unauthenticated public internet the worst of the lemmyverse without realizing it… which is an entirely avoidable situation.
They offered an option to limit the sub count, but the default is still unlimited. They seem aggressively against more sensible defaults in other posts.
It’s also worth noting that there’s an upper limit on the number of communities you choose to federate with, while there doesn’t seem to be an upper limit on the blocked communities