Month: July 2012

  • The End of a World : 21 December 2012

    Well, it is a quite current subject, as I can find many things about it just through making a search on Google. So after having been captivated by this date for a few days, I will give you a probable version of it. Because I think something will really happen on December 21, or not.

    My scenario of this 21st December is a digital holocaust (I have found no satisfying term to describe it) making mechanically other events happen: one country, which has build a defense against other country’s attack is attacked. This system strikes back by attacking the country from which the attack has been lead, leading the Internet connection to be overflooded and thus unable to complete anything for hours. But the system has a side effect making it strike another country. The latter attacks but due to side effect, other countries are also affected. At each attack the entire bandwidth of a country is affected to “war effort”.

    Sure, during that date, many things will happen: many sects on this world will commit collective suicide or occupy massively a place that is not made for. Many nations will have their Internet connection cut, so they can not be aware of what’s happening, even in their own country. Most people will be unable to telephone due to the Net War. Everyone will be disconnected to each other, and breaking news won’t have many readers.

    A financial crisis due to internet shutdownCut to the rest of the world, the markets of London, Paris, Tokyo or New York will go mad: the price of precious and rare raw materials is increasing, making the insecurity growing: you will likely to be killed for detaining on you precious items such as telephone, or golden artefacts (rings, necklaces…). In response to this, the government will increase the number of policemen operating in cities, but this has quite huge repercussion on the State’s budget, making national debt increase. Debt increase is not appreciated by the markets, and already very fragile States will be going to lack money to pay all their civil servants. To avoid this, taxes will be increased in order to have some funds to make the State and all its institution work… temporarily.

    Tax increases are never appreciated, even more in economic crisis. In fragile countries (Spain, Greece, Portugal…), the situation will turn into civil war. As money got from taxes goes decreasing. As there is no money anymore to pay them, civil servants are fired, and become officially unemployed. Actually many people will do unreported employment. For the few people still working reportedly, dissatisfaction towards the government policy will grow, leading to more and more frequent and violent demonstrations. To be able to contain demonstrations, the police will become more and more violent, making demontrations become carnage and bloodshed. But due to markets holding governments by their balls tax will increase again and again. But without any effect as already more than 80% of people have at least one unreported work. This is a great loss for taxes, so these 80% will be stalked by the government, leading to civil war.

    Overflooded by stalking workers trying to survive, and by robbers breaking everyone’s home, policemen no longer grant security in cities. So more and more people leave these cities for the surrounding countrysides. People will be organised in small communities, with their own militia to defend the community’s property. The State no longer exists. It is the end of a word: the one we knew before.

  • Search on Google using Python scripts

    What about a free unlimited Google API? In the past, Google provided such thing, but it is definitely deprecated (due to abuses?). The new Search API needs money ($5 for 1,000 queries), and the free API has a limited use of 100 queries per day. Without any money, you won’t get far. After getting that information. I let down that project… Until I contribute to Wiktionary!

    Extracting words from Malagasy daily newspapers to Malagasy Wiktionary weren’t actually an easy thing to program. At the first version of the script. It only can parse RSS feeds, and is very slow compared to what I used to know. It is because it loads approx. 400,000 words at each launch.
    While doing that work. I have noticed that there are a plenty of words that are actually compounded words.This notice gave me an idea: anticipate through looking on google search whether the word exists or not: because on 1,300 roots contained on the Malagasy Wiktionary, I can potentially make 1.7 million by combining two nouns,  2.2 billion with three, and likely 2.8 trillion using four roots. That is enormous, and even at full regime, I will never be able to look for them all: at 5 queries per second (fastest rate I’ve ever had) it will take respectively 4 days with 2 roots, 14 years with three and eventually 177 centuries (17,700 years) for four roots. This is the first reason for which I have decided to try hacking Google Search to see if the word combination has already been used.

    First, I looked to the page source, and it is very, very complicated to understand. I even think that this page was made by bot as html tag names are not written in a human language. I also have tried to use the URL but it is actually very, very long, with characters that look more like hashes and keys (?), not findable as they don’t explicitly appear on the main page form. At first sight, this kind of project is likely to fall…

    I have found on the Web a post describing how to use the Google Search without any API. But there was a problem: the discussion is almost three years old. And when downloaded, the search engine has visibly been changed: it is very probable that a Google employee reported that discussion leading the company to take adequate measures. When I ran the script, all I could see was that there was nothing operational: no results were given when doing any search. I still keep an eye on the downloaded script. And I am trying to find something which can solve this problem. This script just avoided me to spend hours and hours reinventing a (square) wheel.

    Once this problem is solved, at least temporarily, the source code will be released on SourceForge: Bot-Jagwar. It will rapidly fall into deprecation, so if there are peoples willing to update the script. They’ll be welcome :).