agentharbor commited on
Commit
9cf039d
·
verified ·
1 Parent(s): e1fe61c

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +2 -15
app.py CHANGED
@@ -25,7 +25,7 @@ According to the features you have and the data structure given below, determine
25
  Then list 5 interesting questions that could be asked on this data, for instance about specific correlations with target variable.
26
  Then answer these questions one by one, by finding the relevant numbers.
27
  Meanwhile, plot some figures using matplotlib/seaborn and save them to the (already existing) folder './figures/': take care to clear each figure with plt.clf() before doing another plot.
28
-
29
  In your final answer: summarize these correlations and trends
30
  After each number derive real worlds insights, for instance: "Correlation between is_december and boredness is 1.3453, which suggest people are more bored in winter".
31
  Your final answer should be a long string with at least 3 numbered and detailed parts.
@@ -39,20 +39,7 @@ The data file is passed to you as the variable data_file, it is a pandas datafra
39
  DO NOT try to load data_file, it is already a dataframe pre-loaded in your python interpreter!
40
  """
41
 
42
- example_notes="""This data is about the Titanic wreck in 1912.
43
- The target figure is the survival of passengers, notes by 'Survived'
44
- pclass: A proxy for socio-economic status (SES)
45
- 1st = Upper
46
- 2nd = Middle
47
- 3rd = Lower
48
- age: Age is fractional if less than 1. If the age is estimated, is it in the form of xx.5
49
- sibsp: The dataset defines family relations in this way...
50
- Sibling = brother, sister, stepbrother, stepsister
51
- Spouse = husband, wife (mistresses and fiancés were ignored)
52
- parch: The dataset defines family relations in this way...
53
- Parent = mother, father
54
- Child = daughter, son, stepdaughter, stepson
55
- Some children travelled only with a nanny, therefore parch=0 for them."""
56
 
57
  def get_images_in_directory(directory):
58
  image_extensions = {'.png', '.jpg', '.jpeg', '.gif', '.bmp', '.tiff'}
 
25
  Then list 5 interesting questions that could be asked on this data, for instance about specific correlations with target variable.
26
  Then answer these questions one by one, by finding the relevant numbers.
27
  Meanwhile, plot some figures using matplotlib/seaborn and save them to the (already existing) folder './figures/': take care to clear each figure with plt.clf() before doing another plot.
28
+ Generate a summary of each of the plot generated.
29
  In your final answer: summarize these correlations and trends
30
  After each number derive real worlds insights, for instance: "Correlation between is_december and boredness is 1.3453, which suggest people are more bored in winter".
31
  Your final answer should be a long string with at least 3 numbered and detailed parts.
 
39
  DO NOT try to load data_file, it is already a dataframe pre-loaded in your python interpreter!
40
  """
41
 
42
+ example_notes="""This data is about the Titanic wreck in 1912. I am interested in the survival statistics."""
 
 
 
 
 
 
 
 
 
 
 
 
 
43
 
44
  def get_images_in_directory(directory):
45
  image_extensions = {'.png', '.jpg', '.jpeg', '.gif', '.bmp', '.tiff'}