! Aware >
under Open source license >
Activity specific > Information Tools > WWW > Robots and Proxies >
WWW robots and proxiesSubsets on this page: - #Apps & Utilities - #Libs & Functions - Change Selections: Use Defaults. - #Personalize - |
| ||
Home By TONY By MARK By JERRY By ANN By ERICA Subjects By activity User Interface Text Strings Math Processing
Stored Data
Communications
Hard World File System
|
Dead Link Check (DLC) - DLC - HTTP link checker written in Perl. Can generate HTML output for easy checking of results and process a link cache file to hasten multiple requests. Initially created as an extension to Public Bookmark Generator (PBM); can be used alone. {(L)GPL}
site-dater.pl - Generates a table of web links within a local hierarchy sorted by date. {PD}
SiteMap - Creates an HTML SiteMap of your *.*htm* files {GPL}
ht://Dig - Complete world wide web indexing and searching system {GPL}
Checklinks - HTML link checker that supports SSI, many Apache options, and more (in Perl 5) {OpenSource}
PerlLeech - The program will given a set of keywords and file extensions go out to a set of search engines and search for files and download these. You will be able to specify the maximum recursive page downloads. {(L)GPL}
Web Resource Application Framework - Wraf implements a RDF API that hopes to realize the Semantic Web. The framework uses RDF for data, user interface, modules and object methods. It uses interfaces to other sources in order to integrate all data in one enviroment, regardless of storage f {(L)GPL}
mebay - MeBay is a Perl/GTK client for eBay with support for "My eBay" bid and watch items, and support for several types of item searching. Item images can also be displayed when possible. {(L)GPL}
Lucrezia cover traffic system - Simulates the behaviour of a human Web surfer by downloading pages, filling in forms, etc. and leaking realistic "personal information" to prevent marketers and other snoopy persons from tracking the behaviour of real human users. {oss}
Yet Another Ticker - YAT is another ticker program. However, it uses Yahoo's excellent and comprehensive news feed to create a ticker that can be read throughout the day. {(L)GPL}
Headlines - Headlines is an application to combine all the Internet news feeders in one place. {(L)GPL}
DelphiRSS - DelphiRSS is a set of native Delphi VCL components which allow you to write applications that use and display RSS headlines. It incorporates a RSS parser and a component to retrieve RSS files via HTTP {oss}
KOffle - KOffle is a tool for managing your wwwoffle-spools (http and ftp) and your outgoing requests. {(L)GPL}
notify - Notify (website) visitors of changes to your site. {GPL}
CheckURL - Sends notification e-mails for changed URLs {GPL}
DejaSearch - DejaSearch is a frontend to DejaNews, the leading Usenet archive {GPL}
Web Secretary - Web page monitoring software {GPL}
netcomics - A perl script that downloads today's comics from the Web {GPL}
sitecopy - Maintain remote copies of locally stored web sites {GPL}
DraE Tracking - Allows servers to provide free tracking to web sites {GPL}
FastLink - FastLink is a free Java Applet that displays mirror sites sorted by their respon {GPL}
The Internet Junkbuster - The Internet Junkbuster v2.0.2 {GPL}
EHeadlines - Root Menu news system. {x,GPL}
gtkMeat - A Freshmeat new submissions ticker {x,GPL}
gtkSlash - Gtk+ based Slashdot headlines news ticker {x,GPL}
Kget - KDE app to get files from the internet {x,GPL}
asScotch - The days UserFriendly comic strip in your AfterStep rootmenu {x,GPL}
asTequila - The AfterStep Resource Page (TARP) headlines in your AfterStep rootmenu {x,GPL}
Squid - High performance Web proxy cache {GPL}
w3mir - HTTP copying and mirroring program {Artistic}
WWWOFFLE - Simple proxy server with special features for use with dial-up internet links {GPL}
freshmeat newsletter to HTML converter - procmail filter to convert freshmeat email newsletter to HTML {Artistic}
webcrawl {PD}
ECLiPt-Mirror - Full-featured mirroring script {GPL}
pavuk - Webgrabber with an optional Xt or GTK GUI {GPL}
snarf - Command-line URL retrieval tool with some unique features. {GPL}
ticker - Configurable text scroller, with slashdot and freshmeat modules {GPL}
curl - Tiny command line client for getting data from a URL {GPL}
swebget - Prints a webpage to stdout {GPL}
GNU Wget - Network utility to retrieve files from the World Wide Web {GPL}
PathFinder - A personal web search engine {GPL}
HTTPGate - A Filtering HTTP Gateway {GPL}
Muffin - Filtering proxy server for the World Wide Web written entirely in Java {GPL}
tinyproxy - A small, lightweight, easy-to-configure HTTP proxy. {GPL}
Internet Junkbuster - Blocks unwanted banner ads and protects your privacy {GPL}
Kticker - News ticker widget that downloads news headlines and displays them periodically {x,GPL}
urlredir - URL redirector for use with the squid proxy server {GPL}
DailyUpdate - Grabs dynamic information from the internet and integrates itinto your webpage {GPL}
Web User Interface - Builds a list of all available personal homepages. {GPL}
CGIProxy - Anonymizing, filter-bypassing HTTP proxy in a CGI script (in Perl) {OpenSource}
Get Right - HTTP resume for failed transfers. {GPL}
Web Tree Scanner - A program to visualize the tree of a WWW server and check the links [X] {GPL}
Slashdot Reader - Slashdot Reader written in Pike/GTK. [X] {PD}
httptunnel-3.3 - Tunnel a tcp/ip connection through a http/tcp/ip connection
asGin - Linux Today headlines in your AfterStep root menu [X] {GPL}
urlmon - URL monitoring and report tool {GPL}
python web library - Powerful, lightweight web library for python. These modules emulate the Request and Response objects from ASP, and Sess, Auth, Perm, and UserSess from PHPLIB. The sensible alternative to Zope! :) {oss}
curl_version - Return the current CURL version
curl_close - Close a CURL session
curl_exec - Perform a CURL session
curl_setopt - Set an option for a CURL transfer
curl_init - Initialize a CURL session
HTTP::Status - Processes status codes sent over HTTP, e.g. "403 Forbidden", "4040 Not Found", or "402 Payment required". Part of the libwww bundle. [Perl] {oss}
LWP::RobotUA - Create your own Web robot. Part of the libwww bundle. [Perl] {oss}
WWW::Robot - A traversal engine for your Web robot. [Perl] {oss}
WWW::RobotRules - Nice Web robots, as they scour the Net for treasure, heed a robots.txt file if they find one. Information about the Robot standard can be found in http://info.webcrawler.com/mak/projects/robots/norobots.html. [Perl] {oss}
ARS - A Web client for Remedy's ARS system. Useful only if you're already using ARSPerl. [Perl] {oss}
Related Subjects (under Open source license) |
(The following links to subjects at this site retain your personalized selections.)
WWW Servers - Respond to HTTP requests
WWW authoring - Creating HTML, CGI
WWW Browsers - User interface for accessing the WWW
Up to: World Wide Web - HTTP, HTML, standards, browsers, transfer utilities, servers, et al.
Personalized Selections | |||
Use our system: Bring Rapid Knowledge Transfer and Awareness to your company website! |