Skip to main content

HOW MANY "THE" IN A SENTENCE?

 

      We know "the" is the most occurring word in English language. I want to find out how many 'the' occurs in a sentence normally.  In a passage of 100 words, mostly 7 'the' occurs.  The probability of occurrence is 0.07.  Also, it is found that normally English sentence contains 19 words.

     The probability of occurrence of two 'the' in a sentence of 19 words is,
= C(19,2)* 0.07^2 * 0.93 ^17 = 24.4%
Let us understand the formula step by step.
1. The probability of 'the' not being present is 1-0.07 =0.93.  The probability of 'the' not being one of the 17 words is 0.93^17.
2. The chance of 'the' appearing two times is 0.07^2.
3. C(19,2) is called combination or binomial co-efficient.  It gives, the no.of ways two 'the' and the remaining 17 words can be arranged.  Refer foot-note.  All the three factors should be multiplied to get the correct probability.  It gives 0.244 or 24.4%.

     Similarly what is the chance of 'the appearing one time only in 19 word sentence.
= C(19,2) * 0.07 * 0.93 ^18 =0.36=36%
Again, the chance of 'the not being present even one time is
=C(19,0)*0.07*0.93^19 = 0.252=25.2%

Add all the three probabilities.
The zero time =25.2
       one time   =36.0
       two time  = 24.4
                          85.6%

Hence 'the ' appearing zero, one or two times in a sentence is more than 85%"the' occurring more than two times in a normal sentence is very unlikely.
     In similar way, we can analyze English text.  One more result is the average word length is 5 letters.  You can do text analysis using 'advanced text analysis' featured on the website English.com.
--------------------------------------------------------------------------------------------------
Foot note:
     How many ways you can arrange one A two B.
ABB, BAB, BBA - 3 ways
combination = Factorial 3/Factorial 1*factorial(3-1) = 3*2.1/1*1*2 =3
    This is how we find the no. of ways of arranging two 'the' and 17 words.  Online calculators are available to find combinations.    

Comments

Popular posts from this blog

LISSAJOUS FIGURES

  Definition:  "When a particle is subjected to two sine wave motion or two oscillatory motion at right angles, the particle describes lissajous figures".      We know sine wave motion and circular motion is basically same.  Hence we draw two circles A and B perpendicular to each other.  The circle B rotates twice faster than circle A.  That is, frequency of circle B is two times than that of A.        A particle at the intersection of two circles is subjected to two sine wave motion   A and B at 90 degree simultaneously.  The particle will describe figures depending on the frequency and phase of A and B .  In our case, the ratio of frequency is  1:2 and the two waves are in phase.        To draw lissajous figures :  A moving point in both the circles are chosen.   Here we should remember; during the time taken by the circle A to complete one rotation, circle B completes two.  Hence the points are marked on the circles according to their speed.  Then straight lines

THE PARABOLA

          A jet of water shooting from a hose pipe will follow a parabolic path.  What is the so special about parabola.    Y= x^2 Draw a graph for the above equation.  It will result in a parabola.  This parabola is also called unit parabola.  Any equation involving square will yield a parabola. Example:  Y = 2x^2 +3x+3 (also called quadratic equation)    X= 2 and -2, both  satisfies the equation 4 = X^2.  Parabolic equations always have two solutions.     Any motion taking place freely under gravity follows parabolic path. Examples:   An object dropped from a moving train,   A bomb dropped from flying plane,  A ball kicked upwards.      If a beam of light rays fall on the parabolic shaped mirror, they will be reflected and brought to focus on a point.  This fact is made use of in Dish Antenna, Telescope mirrors, etc.      Inverted parabola shape is used in the construction of buildings and bridges.  Because the shape is able to bear more weight.      A plane

CASINO'S GAME

           Let us find out how the casino survives with mathematics.      Say, your friend invite you for a game of dice.  You must bet (wager) 2 dollars.  If you roll 'six' you will get back 8 dollars.  The game will go on for 30 rounds.  All sounds good.      The probability of rolling 'six' is 1/6.  Since the game will be played for 30 times, the 'expected win' is 30*1/6 = 5.  That is, you are expected to win 5 rounds out of 30.  Hence your gain will be 5 * 8 =40 dollars.  ok.  This also implies that you will loose 25 rounds.  Hence your loss will be 25*2 =50 dollars.  Your net gain will be gain-less = 40-50 = -10 dollars. For 30 rounds, the loss is -10 dollars, Hence, for one round =-10/30 = -1/3 dollars.  There will be a loss of -1/3 or 0.33 dollars per round.  It is not a fair game.     Let us make a simple formula to calculate  'Pay out per round\. The probability for a win = p The pay-out in case of win = V No. of rounds = n The expect