this post was submitted on 13 Nov 2023
1 points (100.0% liked)

Emacs

311 readers
3 users here now

A community for the timeless and infinitely powerful editor. Want to see what Emacs is capable of?!

Get Emacs

Rules

  1. Posts should be emacs related
  2. Be kind please
  3. Yes, we already know: Google results for "emacs" and "vi" link to each other. We good.

Emacs Resources

Emacs Tutorials

Useful Emacs configuration files and distributions

Quick pain-saver tip

founded 1 year ago
MODERATORS
 

Are there any existing plaintext file format to for storing discussion forums posts like this one or ubuntuforums. I want to archive the discussion I like locally. i have been using singlefilez for download the whole page into my machine, but i prefer plaintext formats. When I tried Org-web-tools, it does not seem to properly extract reddit discussion pages for example.

I suppose I can write a scraper and dump content in json format. I'd prefer a plaintext format like org-mode and was designed with some thought put into this, instead of me cobbling something together.

top 4 comments
sorted by: hot top controversial new old
[–] [email protected] 1 points 11 months ago

There is nnreddit.

There is also the RSS feed (add ".rss" to a subreddit's url). But that only has the posts, not the comments.

[–] [email protected] 1 points 11 months ago

mbox would be perfect. You can use Gnus or rmail to view them.

[–] [email protected] 1 points 11 months ago

There are some packages that you can utilize:

They both use org-mode format to display discussions.

[–] [email protected] 1 points 11 months ago

I suppose I can write a scraper and dump content in json format.

No need, reddit already provides their data in JSON form. Generally just append .json at the end of the URL and you get your JSON, for example

https://www.reddit.com/r/emacs/comments/17u00j0/extracting_forums_posts_like_reddit_discussions/

->

https://www.reddit.com/r/emacs/comments/17u00j0/extracting_forums_posts_like_reddit_discussions.json