PYTHON
PROJECT – ENGLISH LANGUAGE EXPLORATION

 

 

DESCRIPTION


You will write a program that will analyze the content of a series of articles.


TASK


Find at least five lengthy articles in English on different topics.  Copy and paste them into a text file in Pycharm.

 

Answer the following predictive questions:

  • Which 5 letters do you think are the most commonly used?
  • Which 5 letters do you think are the least commonly used?
  • Do you expect similarities over different articles?  Which ones?
  • Do you expect differences over different articles?  Which ones?

 

Write a functions necessary to do the following:

 

  • Count total characters in each file.
  • Count total sentences in each file.
  • Count the average number of words per sentence.
  • Count the number of occurrences of each letter in the alphabet.

 

Run your code on each file.

 

Analyze the data and see if there is a consistency between different text files.  Create a document presenting your findings.

 

Present your findings to your teacher.

 

PRESENTATION

 

  • You will demonstrate your working code.

  • You will discuss the results and any consistencies and inconsistencies that you found in different files.

  • You should have several charts sharing information such as:
    • The most used letters
    • The least used letters
    • The percentage of occurrences of each letters
    • The average number of words in a sentence.
    • A comparison of the different files.

 

OTHER IDEAS

 

Consider adding a few files of a different language (same alphabet) or a different time period to see if that writing has specific differences/similarities.