this post was submitted on 01 Aug 2023
94 points (97.0% liked)

Ask Lemmy

27073 readers
2153 users here now

A Fediverse community for open-ended, thought provoking questions

Please don't post about US Politics. If you need to do this, try [email protected]


Rules: (interactive)


1) Be nice and; have funDoxxing, trolling, sealioning, racism, and toxicity are not welcomed in AskLemmy. Remember what your mother said: if you can't say something nice, don't say anything at all. In addition, the site-wide Lemmy.world terms of service also apply here. Please familiarize yourself with them


2) All posts must end with a '?'This is sort of like Jeopardy. Please phrase all post titles in the form of a proper question ending with ?


3) No spamPlease do not flood the community with nonsense. Actual suspected spammers will be banned on site. No astroturfing.


4) NSFW is okay, within reasonJust remember to tag posts with either a content warning or a [NSFW] tag. Overtly sexual posts are not allowed, please direct them to either [email protected] or [email protected]. NSFW comments should be restricted to posts tagged [NSFW].


5) This is not a support community.
It is not a place for 'how do I?', type questions. If you have any questions regarding the site itself or would like to report a community, please direct them to Lemmy.world Support or email [email protected]. For other questions check our partnered communities list, or use the search function.


Reminder: The terms of service apply here too.

Partnered Communities:

Tech Support

No Stupid Questions

You Should Know

Reddit

Jokes

Ask Ouija


Logo design credit goes to: tubbadu


founded 1 year ago
MODERATORS
 

Regardless of the kind of news. I'm working on a TLDR bot and I'd like it to support the most used sites on Lemmy.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 2 points 1 year ago (2 children)

Oh I see, no, that's sadly not possible to do universally, I have to evaluate the structure of each site to find the text content and archive basically copies the structure of the target website, meaning there's no single structure for achive.org.

[–] [email protected] 1 points 1 year ago

That’s what I was afraid of. I was hoping the site itself would format the content in a way to make it possible. Thanks for the effort!

[–] [email protected] 1 points 1 year ago (1 children)

Would it be possible to use the structure for the NYTimes on archive.org, as a way of bypassing the account and JavaScript requirements? Or am I misunderstanding how that would work

[–] [email protected] 2 points 1 year ago

I think it would be possible, yes.