Tag Archives: perl

Automatically download log files from hosting control panel

I’m currently using an Italian hosting service, Webperte, for my Italian blog. The service is fine, except for one minor frustration: they don’t support ftp retrieval of the web server access logs used by log based web analytics systems such as Google’s Urchin.

The official solution is to log in to a hosting control panel, navigate a few screens and click on the log download links… a rather tedious process. Fortunately, the perl scripting language offers a relatively easy way to automate this process. I’ve hacked together a script, retrieve-hosting-logs.gz, which is designed to log in to a Parallels Business Automation control panel and download the two most recent access, error or ftp log files as desired. Feel free to use it at your own risk and don’t expect support you haven’t paid for :-).

Primary Features

  1. Hard code username and password or specify them on the command line
Comments Off

People, Internet and Enterprise Business, all without mentioning Google.

Last Wednesday I had the fortune to attend a world class conference on social behavior and technology applied to medium and large sized businesses. Not in San Francisco. Not in Boston, where I worked for 4 years. Not in Milan, even. In Varese. Right, Varese, once known more for shoe production. The conference, the International Forum on Enterprise 2.0, was held at L’Università dell’Insubria as part of their 10th anniversary celebration.

As a search marketing consultant, I was very interested in how the social web is being applied to business environments. The very intertwined nature of the web means that no web marketing project should be seen in isolation. <rant>Thanks to the kind folks at Trenitalia, who canceled my train from Tuscany at the last minute, I almost didn’t make it. Not that you’d find any news about this on their website.</rant>


How to Specify an HTML Web Document Language for good SEO

So you just wrote a stunning essay on James Joyce’s Ulysses – in Irish Gaelic. Will Yahoo!, Google, Microsoft and Ask recognize it as Gaelic, hosted as it is on your co.uk domain? Maybe. But you can given them a hint!

The trick is to use all of the HTTP and HTML language code settings available to your advantage to ensure your documents aren’t falsely identified. This article considers HTTP and HTML aspects of website internationalization for search engine optimization.

Why is Language Recognition a Problem?

Search engines try to match a web searcher’s language (based on ip geo location recognition or user specified preferences) to web documents when determining the best matches for a search query. In some cases, a user may specify that results be limited to a specific language. Left to their own devices, search engines have a few clues to determine the human language of a document:

1 Comment

Unofficial documentation of Ask’s Web Search API

In part one of this article, we set out to document the little known Ask web search API by providing background information. In this continuation, we’ll look at the actual API details.

Note Update: Ask disabled access to their API on 6 March 2007. We are working on obtaining additional information. Write us if you would like to be notified of further developments.

NoteThe following information was determined by observation and conjecture. Write us if you want to be notified when we update this page with more complete information. We are assuming the reader has already worked with REST queries and is familiar with parsing XML data.

Request URL

The request URL is formed by adding query parameter and their values to a base URL using the format query parameter=values. Successive parameters are added using a & before each parameter.

Base URL: http://xml.teoma.com/e?

Request URL parameters should be URL encoded.