I recently started up FreshRSS in my docker environment. I was super excited about the web scraping feature.

Now that I’m setting it up, it looks like that it is able to scrape single web pages, but I am unable to figure out how to get it to crawl into the actual article to scrape the full content.

Is anyone aware of how to do this. For example, runescape.com/m=news/ This page has a list of articles with a thumbnail, title, category, date, and a short description of the article. Would it be possible for FreshRSS to crawl into the article link and scrape the contents within?

  • EliteCow@lemmy.dbzer0.comOP
    link
    fedilink
    English
    arrow-up
    0
    ·
    1 year ago

    Thank you again :). From your explanation, I think I have a good grasp on how to identify the proper CSS elements now.

    Have a wonderful day!

    • gnzl@nc.gnzl.cl
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      No problem! FreshRSS really is amazing so I’m happy to help and spread the love.