Tag Archives: perl
I’m currently using an Italian hosting service, Webperte, for my Italian blog. The service is fine, except for one minor frustration: they don’t support ftp retrieval of the web server access logs used by log based web analytics systems such as Google’s Urchin.
The official solution is to log in to a hosting control panel, navigate a few screens and click on the log download links… a rather tedious process. Fortunately, the perl scripting language offers a relatively easy way to automate this process. I’ve hacked together a script, retrieve-hosting-logs.gz, which is designed to log in to a Parallels Business Automation control panel and download the two most recent access, error or ftp log files as desired. Feel free to use it at your own risk and don’t expect support you haven’t paid for .
- Hard code username and password or specify them on the command line
So you just wrote a stunning essay on James Joyce’s Ulysses – in Irish Gaelic. Will Yahoo!, Google, Microsoft and Ask recognize it as Gaelic, hosted as it is on your co.uk domain? Maybe. But you can given them a hint!
The trick is to use all of the HTTP and HTML language code settings available to your advantage to ensure your documents aren’t falsely identified. This article considers HTTP and HTML aspects of website internationalization for search engine optimization.
Why is Language Recognition a Problem?
Search engines try to match a web searcher’s language (based on ip geo location recognition or user specified preferences) to web documents when determining the best matches for a search query. In some cases, a user may specify that results be limited to a specific language. Left to their own devices, search engines have a few clues to determine the human language of a document: