Download boilerpipe-1.1.0.jar file

Description

The boilerpipe library provides algorithms to detect and remove the surplus "clutter" (boilerplate, templates) around the main textual content of a web page. The library already provides specific strategies for common tasks (for example: news article extraction) and may also be easily extended for individual problem settings. Extracting content is very fast (milliseconds), just needs the input document (no global or site-level information required) and is usually quite accurate. Boilerpipe is a Java library written by Christian Kohlsch?tter. It is released under the Apache License 2.0. The algorithms used by the library are based on (and extending) some concepts of the paper "Boilerplate Detection using Shallow Text Features" by Christian Kohlsch?tter et al., presented at WSDM 2010 -- The Third ACM International Conference on Web Search and Data Mining New York City, NY USA.

You can download jar file boilerpipe 1.1.0 in this page.

License

The Apache Software License, Version 2.0

Build File

You can use the following script to add boilerpipe-1.1.0.jar to your project.

<dependency>
   <groupId>de.l3s.boilerpipe</groupId>
   <artifactId>boilerpipe</artifactId>
   <version>1.1.0</version>
</dependency>

compile group: 'de.l3s.boilerpipe', name: 'boilerpipe', version: '1.1.0'

libraryDependencies += "de.l3s.boilerpipe" % "boilerpipe" % "1.1.0"

<dependency org="de.l3s.boilerpipe" name="boilerpipe" rev="1.1.0"/>

@Grapes(@Grab(group='de.l3s.boilerpipe', module='boilerpipe', version='1.1.0'))

'de.l3s.boilerpipe:boilerpipe:jar:1.1.0'

Download

Click the following link to download the jar file.

boilerpipe-1.1.0-javadoc.jar
boilerpipe-1.1.0-sources.jar
boilerpipe-1.1.0.jar
boilerpipe-1.1.0.pom

Download boilerpipe-1.1.0.jar file - Jar b

Description

License

Build File

Download

Related Tutorials