Charles Explorer logo
🇬🇧

User-Friendly and Extensible Web Data Extraction

Publication at Faculty of Mathematics and Physics |
2018

Abstract

In this paper we present a new wrapping language-Serrano-that has three goals: (1) ability to run in a restricted environment, such as a browser extension, (2) extensibility to balance the tradeoffs between expressiveness of a command set and safety, and (3) processing capabilities to eliminate the need for additional programs to clean the extracted data. Serrano has been successfully deployed in a number of projects and provided competitive results.