libextract

на сайте с December 12, 2022 09:19

Libextract: extract data from websites. Libextract is a statistics-enabled data extraction library that works on HTML and XML documents and written in Python. Originating from eatiht, the extraction algorithm works by making one simple assumption: data appear as collections of repetitive elements. You can read about the reasoning here.

Скачать

^* Extension для Google Chrome

Разрабатывая это приложение я хотел бы чтобы любой мог найти похожие инструменты, технологии, техники и приёмы так же легко, как если бы вы искали в Google "Ruby vs ..." или "Awesome Ruby"

— Корнев Руслан (@woto)

Или воспользуйтесь нашим Телеграм ботом для добавления упоминаний.

Подробнее