Skip to main content

CAN YOU COIN A REAL WORD?

 

   Just try to create a five letter word randomly,
tidkl,cewkx,dmwol,
vuptg,hvwjk,naqid
    The words are not only meaningless but also difficult to pronounce.

    Actually, how many five letter word you can make in English language.
26*26*26*26*26 =11881376 words.
because, there are 26 choices for each position.
     English has nearly 200000 words.  There may be 40000 real meaningful 5 letter words.
40000/11881376 = 0.003 or 0.3%
    If you or a computer generates five letter words randomly, there is only 0.3% chance for getting real words.
    You see, coining a real word or name is very difficult.  People want new names for new born child, medicine, fictional characters etc.  Let us try to get real new names.

    1. All English words has vowels.  Hence, add a vowel or two deliberately in each words.
Now, actot, verrs,aglgo...
    Some words can be pronounced.  A little improvement.
    2. The frequency of each letter in English language is known and given below.

a b c d e f g h i j k l m n o p q r s t u v s t u v w x y z

82 15 28 42 127 22 20 61 70 2 8 40 24 67 75 19 1 60 63 90 27 10 24 2 20 1


     That is, if a passage contain 1000 letters, 'a' is likely to appear 82 times; 'b' 15 times, z one time and so on.
    Imagine die or dice having 1000 faces.  'a' is engraved on 82 faces in it; b on 15 faces; z on only one face and so on.  Now roll the multifaceted die 5 times.  each time note down the letter that appears and coin a five letter word.  (This thought experiment can easily be simulated by a computer).  Now, we get these kind of words.
    elnao, segty, least, soyie, laarm
    More important.  Most can be pronounced.  Some are close to real life words.  Some success.
    3. A given letter is likely to followed by certain letters.  For example, 't' is mostly followed by 'h'.  'o' is followed by 'a'.
    For each of 26 letters, we can find highly probable following letter.(including space)
    Incorporating this idea in the computer algorithm, we get this result.
    "the cur the bund hof arytowno....
Now, we get not words but sentences -of course-nonsense.
    4. Give a sample passage to the computer.
     1. Computer select a letter randomly say 't'
     2. Using the passage, computer will find out the letter which is most likely to         follow 't'.  It is 'h' we know.
     3. Again using the same passage computer will pick up a letter which is most        likely to follow the pair "th" . It is 'e'
    4. Next, computer may find out the letter that will follow the three letter                    combine 'the'
    This process can be extended up to 4 letter or 5 letter combine.
    The entire algorithm can be repeated again and again to coin 5 letter words.  Here is the result.
    "ther was just in time it all seemed quite natural.

   Looks good.  Some day computer may write a poem or even a epic.  
----------------------------------------------------------------

Comments

Popular posts from this blog

THE EARTH, A SUPER ORGANISM

     JOIN MY COURSE: "Become a programmer in a day with python"       A man called 'love lock' (what a name) proposed a theory called Gaia theory, named after Greek Goddess.      It says, "Earth is a self-regulating organism like a human being.  The organic life in it interacts with in-organic matter and maintains atmosphere, temperature and environment".  Hence the earth is still suitable for the life to thrive.      Imagine, in a particular place, there are lot of flowers.  Some flowers are white and some are darkly coloured.  We know, white reflects light and heat while dark absorbs the same.  White flowers can thrive in hot climate.  But dark flowers requires cold climate.  The absorption and reflection balances and the environment reaches average, warm temperature at which both the flowers can co-exist.  This is the essence of "Gaia" theory.      On our earth, the oxygen constitute 20% of the atmosphere.  The oxygen level is always mai

THE PARABOLA

          A jet of water shooting from a hose pipe will follow a parabolic path.  What is the so special about parabola.    Y= x^2 Draw a graph for the above equation.  It will result in a parabola.  This parabola is also called unit parabola.  Any equation involving square will yield a parabola. Example:  Y = 2x^2 +3x+3 (also called quadratic equation)    X= 2 and -2, both  satisfies the equation 4 = X^2.  Parabolic equations always have two solutions.     Any motion taking place freely under gravity follows parabolic path. Examples:   An object dropped from a moving train,   A bomb dropped from flying plane,  A ball kicked upwards.      If a beam of light rays fall on the parabolic shaped mirror, they will be reflected and brought to focus on a point.  This fact is made use of in Dish Antenna, Telescope mirrors, etc.      Inverted parabola shape is used in the construction of buildings and bridges.  Because the shape is able to bear more weight.      A plane

DISORDER IS THE "ORDER OF THE DAY"

         Imagine a balloon full of air.  The air molecules are moving randomly inside the balloon.  Let us pierce the balloon with a pin.  The air rushes out.  Why should not the air molecules stay inside the balloon safely and ignore the little hole?  That is not the way the world works.  The molecules always "want to occupy as many states as possible".  Hence the air goes out in the open to occupy more volume.   The things always goes into disorder (entropy) and the disorder increases with time.  The above statement is what we call "second law of thermodynamics".      Consider a cup of coffee on the table. Suppose the heat from entire room flows to your cup of coffee, the coffee will boil and the rest of the room will freeze.  Freezing means bringing things to order and arrangement.  It violates the second law.  Hence it will never happen.  Hence heat must flow from high temperature to low temperature and not the other way.        The air molecules in y