Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

PPI.pm

Compare

  Analyzed 31 minutes ago

Parse, Analyze and Manipulate Perl (without perl) The ability to read, and manipulate Perl (the language) programmatically other than with perl (the application) was one that caused difficulty for a long time. The cause of this problem was Perl's complex and dynamic grammar. Although there is ... [More] typically not a huge diversity in the grammar of most Perl code, certain issues cause large problems when it comes to parsing. Indeed, quite early in Perl's history Tom Christenson introduced the Perl community to the quote "Nothing but perl can parse Perl", or as it is more often stated now as a truism: "Only perl can parse Perl" PPI tries to solve this problem. [Less]

14.1K lines of code

6 current contributors

about 1 month since last commit

29 users on Open Hub

Low Activity
4.44444
   
I Use This
Licenses: Artistic_..., gpl

Sparse

Compare

  Analyzed about 16 hours ago

Sparse, the semantic parser, provides a compiler frontend capable of parsing most of ANSI C as well as many GCC extensions, and a collection of sample compiler backends, including a static analyzer also called "sparse". Sparse provides a set of annotations designed to convey semantic information ... [More] about types, such as what address space pointers point to, or what locks a function acquires or releases. [Less]

47.8K lines of code

12 current contributors

3 months since last commit

24 users on Open Hub

Low Activity
4.6
   
I Use This

Apache Tika

Compare

Claimed by Apache Software Foundation Analyzed 1 day ago

The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. Tika is a project of the Apache Software Foundation, and was formerly a subproject of Apache Lucene.

392K lines of code

19 current contributors

2 days since last commit

23 users on Open Hub

Very High Activity
5.0
 
I Use This

feedparser

Compare

  Analyzed 24 minutes ago

Parse Atom and RSS feeds in Python

18.8K lines of code

5 current contributors

7 days since last commit

18 users on Open Hub

Low Activity
5.0
 
I Use This

PyYAML

Compare

  Analyzed 1 day ago

YAML is a data serialization format designed for human readability and interaction with scripting languages. PyYAML is a YAML parser and emitter for the Python programming language. PyYAML features - a complete YAML 1.1 parser. In particular, PyYAML can parse all examples from the ... [More] specification. The parsing algorithm is simple enough to be a reference for YAML parser implementors. - Unicode support including UTF-8/UTF-16 input/output and \u escape sequences. low-level event-based parser and emitter API (like SAX). - high-level API for serializing and deserializing native Python objects (like DOM or pickle). - support for all types from the YAML types repository. A simple extension API is provided. - relatively sensible error messages. [Less]

8.26K lines of code

3 current contributors

5 months since last commit

18 users on Open Hub

Very Low Activity
3.75
   
I Use This
Tags parser python

Spirit Parser Library

Compare

  Analyzed about 8 hours ago

Spirit is an object-oriented, recursive descent parser generator framework implemented using template meta-programming techniques. Expression templates allow Spirit to approximate the syntax of Extended Backus Normal Form (EBNF) completely in C++. The Spirit framework enables a target grammar to be ... [More] written exclusively in C++. EBNF grammar specifications can mix freely with other C++ code and, thanks to the generative power of C++ templates, are immediately executable. [Less]

193K lines of code

13 current contributors

about 2 months since last commit

17 users on Open Hub

Low Activity
4.55556
   
I Use This

Beautiful Soup

Compare

  No analysis available

A Python HTML/XML parser for quick turnaround projects like screen-scraping. 1. Beautiful Soup won't choke if you give it bad markup. It yields a parse tree that makes approximately as much sense as your original document. This is usually good enough to collect the data you need and run away. 2. ... [More] Beautiful Soup provides a few simple methods and Pythonic idioms for navigating, searching, and modifying a parse tree: a toolkit for dissecting a document and extracting what you need. You don't have to create a custom parser for each application. 3. Beautiful Soup automatically converts incoming documents to Unicode and outgoing documents to UTF-8. You don't have to think about encodings, unless the document doesn't specify an encoding and Beautiful Soup can't autodetect one. [Less]

0 lines of code

1 current contributors

0 since last commit

17 users on Open Hub

Activity Not Available
4.25
   
I Use This
Mostly written in language not available
Licenses: BSD-3-Clause

nokogiri

Compare

  No analysis available

Nokogiri is a libxml wrapper. It features an HTML, XML, SAX, and Reader parser, as well as XPath and CSS interfaces for searching. Nokogiri is also a drop in replacement for Hpricot.

0 lines of code

14 current contributors

0 since last commit

13 users on Open Hub

Activity Not Available
5.0
 
I Use This
Mostly written in language not available
Licenses: No declared licenses

args4j

Compare

  Analyzed about 13 hours ago

args4j is a small Java class library that makes it easy to parse command line options/arguments in your CUI application.

5.96K lines of code

1 current contributors

about 1 month since last commit

12 users on Open Hub

Moderate Activity
5.0
 
I Use This

Texy!

Compare

  No analysis available

Texy is one of the most complex lightweight markup language. It allows adding of images, links, nested lists, tables and has full support for typography and CSS. Texy allows you to enter content using an easy to read Texy syntax which is filtered into structurally valid XHTML. No knowledge of HTML is required.

0 lines of code

1 current contributors

0 since last commit

12 users on Open Hub

Activity Not Available
4.83333
   
I Use This
Mostly written in language not available
Licenses: gpl, gpl3