Selected Tags

Click on a tag to remove it

More Tags

Click on a tag to add it and filter down

Web Crawling gems

Showing projects tagged as Web Crawling

  • Mechanize

    8.9 7.0 L4 Ruby
    Mechanize is a ruby library that makes automated web interaction easy.
  • anemone

    7.6 0.0 L5 Ruby
    Anemone web-spider framework
  • Upton

    7.2 0.0 L5 HTML
    A batteries-included framework for easy web-scraping. Just add CSS! (Or do more.)
  • Wombat

    6.5 0.5 L5 Ruby
    Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
  • FastImage

    6.4 3.6 L5 Ruby
    FastImage finds the size or type of an image given its uri by fetching as little as needed
  • MetaInspector

    6.1 2.4 L5 Ruby
    Ruby gem for web scraping purposes. It scrapes a given URL, and returns you its title, meta description, meta keywords, links, images...
  • pismo

    5.6 0.0 L5 Ruby
    Extracts machine-readable metadata and content from Web pages
  • LinkThumbnailer

    4.8 1.3 L5 Ruby
    Ruby gem that fetches images and metadata from a given URL. Much like popular social website with link preview.
  • Vessel

    3.4 5.4 Ruby
    Fast high-level web crawling Ruby framework
  • Screencap

    3.2 0.0 JavaScript
    A gem to screencap webpages in ruby. Uses Phantom.js under the hood.
  • instabot.rb

    2.6 0.0 Ruby
    An instagram bot works without instagram api, only needs your username and password. written in ruby
  • The Hawker Ruby gem

    1.7 0.0 Ruby
    The Hawker gem is a web scraper which allows you to pull the basic information for given social media profile URL
  • html2rss

    1.4 5.6 Ruby
    📰 Build RSS 2.0 feeds from websites (and JSON APIs) with a few CSS selectors.
  • Supplejack API

    1.3 8.1 Ruby
    Supplejack API Mountable Engine
  • Google Search Results in Ruby

    1.1 4.9 Ruby
    Google Search Results via SERP API Ruby Gem
  • Kimurai

    0.6 2.1 Ruby
    Kimurai is a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with Javascript rendered websites