Einfache Projektliste Software-Karte

Text Processing
1877 Projekte im Ergebnis
Letztes Update: 2011-08-03 20:07

DSPAM

DSPAM is a server-side statistical anti-spam agent for Unix email servers. It masquerades as the email server's local delivery agent and effectively filters spam using a combination of de-obfuscation techniques, specialized algorithms, and statistical analysis. The result is an administratively maintenance-free, self-learning anti-spam tool. DSPAM has yielded real-world success rates beyond 99.9% accuracy with less than a 0.01% chance of false positives.

Letztes Update: 2013-12-21 08:03

GNU TeXmacs

GNU TeXmacs is a free wysiwyw (what you see is what you want) editing platform with special features for scientists. The software aims to provide a unified and user friendly framework for editing structured documents with different types of content: text, mathematics, graphics, interactive content. TeXmacs can also be used as an interface to many external systems for computer algebra, numerical analysis, and statistics. New presentation styles can be written by the user and new features can be added to the editor using Scheme.

Letztes Update: 2014-05-22 23:47

GNU Parallel

GNU parallel is a shell tool for executing jobs in parallel locally or using remote computers. A job is typically a single command or a small script that has to be run for each of the lines in the input. The typical input is a list of files, a list of hosts, a list of users, a list of URLs, or a list of tables. If you use xargs today you will find GNU parallel very easy to use, as GNU parallel is written to have the same options as xargs. If you write loops in shell, you will find GNU parallel may be able to replace most of the loops and make them run faster by running several jobs in parallel. GNU parallel makes sure output from the commands is the same output as you would get had you run the commands sequentially. This makes it possible to use output from GNU parallel as input for other programs.

Letztes Update: 2012-12-25 13:15

FBReader

FBReader is an e-book reader. It currently works on the Sharp Zaurus, Siemens Simpad with Opensimpad ROM, Nokia Internet Tablet (Maemo platform), Archos PMA430, Motorola E680i/A780/A1200 smartphones, PepperPad 3, Asus Eee PC, IRex iLiad, UMPC, and desktop computers running Linux, Windows XP/Vista, or FreeBSD. It supports several e-book formats: epub, plucker, palmdoc, zTXT, HTML, CHM, fb2, TCR (psion text), OEB, OpenReader, RTF, non-DRM'ed Mobipocket, and plain text.

Letztes Update: 2014-01-06 21:49

HTMLDOC

HTMLDOC converts HTML files and Web pages into indexed HTML, PostScript, and PDF files suitable for online viewing and printing. It can be used as a standalone GUI application, in a batch document processing environment, as a Web-based report generation application, or in embedded environments to support printing of HTML content. It runs on all Unix platforms as well as Mac OS X and Windows 2000 and higher.

Letztes Update: 2014-01-12 23:10

GNU m4

GNU m4 is an implementation of the traditional Unix macro processor. It is mostly SVR4 compatible, although it has some extensions (for example, handling more than 9 positional parameters to macros). GNU m4 also has built-in functions for including files, running shell commands, doing arithmetic, etc. Autoconf needs GNU m4 for generating configure' scripts, but not for running them.

Letztes Update: 2014-05-25 18:13

XMLTV

XMLTV is a set of programs to obtain and process TV (tvguide) listings and manage your TV viewing. It stores the listings in an XML-based format and most of the programs are filters which read and/or write XML. It includes tools to obtain, sort, grep, print, and munge listings, and two end-user programs to plan a week's TV viewing.

Letztes Update: 2007-04-11 09:12

PHP Content Management System

PHP Content Management System (phpCMS) makes it possible to need only one template for your whole Web site. It allows you to provide dynamic menus with unlimited levels, and use templates and sub-templates without a database. It is search engine-friendly and proxy-friendly, as the pages it generates can not be distinguished from static HTML pages. PHP code can be added to any template and content file with an optional module. It supports the caching of parsed pages and gzip compression.

Letztes Update: 2005-02-07 01:44

teTeX

teTeX is the de facto standard TeX distribution for a wide range of Unix-type operating systems, and serves as a building block for others. It's easy to install and customize. teTeX is not maintained anymore; instead, TeX users should migrate to TeXlive.

Letztes Update: 2014-06-12 08:55

TXR

TXR is a new data munging language. TXR's special pattern language provides template-based matching of entire documents or large sections of documents. It also contains a language for functional and imperative programming. It is written in C and takes the form of a utility that is portable to Unix-like platforms and Windows.

Letztes Update: 2005-01-06 00:38

GNU Aspell

GNU Aspell is a spell checker designed to eventually replace Ispell. It can either be used as a library or as an independent spell checker. Its main feature is that it does a superior job of suggesting possible replacements for a misspelled word than just about any other spell checker out there for the English language. Unlike Ispell, Aspell can also easily check documents in UTF-8 without having to use a special dictionary. Aspell will also do its best to respect the current locale setting. Other advantages over Ispell include support for using multiple dictionaries at once and intelligently handling personal dictionaries when more than one Aspell process is open at once.

Letztes Update: 2014-02-17 20:09

iText

iText is a library that contains classes to generate and manipulate documents in the Portable Document Format (PDF). Document manipulation includes splitting, merging, and filling out forms (AcroForms, static and dynamic XFA forms).

Letztes Update: 2009-10-28 23:02

TinyMCE

TinyMCE is a platform-independent, Web-based, Javascript HTML WYSIWYG editor control. Its main features include theme/template support, language support, and plugin extensions.

Letztes Update: 2005-11-28 06:18

antiword

Antiword is a free MS-Word reader for Linux, RISC OS, and DOS. It converts the documents from Word 2, 6, 7, 97, 2000, 2002, and 2003 to text, Postscript, and XML/DocBook. Antiword tries to keep the layout of the document intact.

Letztes Update: 2014-04-01 23:18

gjots

gjots lets you organize text notes in a convenient, hierarchical way. It can be used for notes, jottings, bits and pieces, recipes, and even PINs and passwords, using encryption. It can also be used to "mind-map" larger compositions like manuals, Web pages, articles, etc. It is a bit like the KDE program "kjots", but uses the GTK library and supports a hierarchy of folders. Files can be output to HTML with an automatic table of contents or to docbook XML. Encryption is supported with ccrypt(1), gpg(1), and openssl(1), so that musings can be kept private.