CSE 231 Summer 2018
Computer Project #3
Assignment Overview
This assignment focuses on the design, implementation and testing of a Python program to analyze
some real data using strings, files, and functions.
It is worth 75 points (7.5% of course grade) and must be completed no later than 11:59 PM on
Tuesday, June 5.
Assignment Deliverable
The deliverable for this assignment is the following file:
proj03.py – your source code program
Be sure to use the specified file name and to submit it for grading via the Mirmir system before the
project deadline.
Assignment Background
We collected national GDP (Gross Domestic Project) data from the Bureau of Economic Analysis (part
of the U.S. Department of Commerce) for the years 1969 through 2015. The GDP is one of the
primary indicators used to gauge the health of a country's economy. It represents the total dollar value
of all goods and services produced over a specific time period; you can think of it as the size of the
economy. When the economy is healthy GDP is increasing, but if GDP is decreasing, that can indicate
problems. Therefore, the change in GDP is useful to examine and that is the goal of this project.
We put the data in a file named GDP.txt that is available in the project directory. The file contains
data on the annual change in GDP which may be positive or negative and the GDP value itself for each
of the years 1969 through 2015.
If you are interested in where we got our data, here is the link:
http://www.bea.gov/iTable/iTable.cfm?ReqID=9&step=1#reqid=9&step=1&isuri=1
(One note about the data. GDP data gets adjusted in a variety of ways. I have extracted two tables
from a few thousand lines of tables in the original file. If you try to derive the annual change in GDP
from the data on line 44, it will differ by as many as a few percentage points from the change data we
are using from line 9. That is, some adjustments are being done that we are ignoring for this
assignment.)
Assignment Specifications
The lines of interest in the file GDP.txt are the 9th line which has the annual change in GDP and the
44th line which has the value of GDP for each year. The data starts in column 76 and each of the 47
data items spans 12 columns (there are 47 years inclusively from 1969 through 2015). These numbers
are important because you can use string slicing to extract them. For example, the first data item in a
line can be extracted using the slice line[76:76+12].
Your task is to find the minimum and maximum change in GDP for the years 1969 through 2015 and
to find the GDP value for those two years. See the sample output below. Your program will prompt
for an input file and then display the output.
You must use specific functions as described below and you are not allowed to use collections such as
lists, tuples and dictionaries.
Mirmir tests: I can provide any file with the same format as the provided GDP.txt—the number of
rows will be the same (so you can select lines 9 and 44), but the number of columns may be different
(i.e. different years).
Assignment Notes
Divide-and-conquer is an important problem solving technique. In fact, inside of every challenging
problem is a simpler problem trying to get out. Your challenge is to find those simpler problems and
divide the problem into smaller pieces that you can conquer. Functions can assist in this approach.
We provide skeleton code on Mirmir that has all the function skeletons for you to fill in.
Hint: build the following functions one at a time and test each one before moving on to the next.
1. The open_file()function takes no arguments and returns a file pointer. It repeatedly
prompts for file names until a file successfully opens. Use the try-except construct
checking for the FileNotFoundError exception. You do not need to check that the
filename is correct—simply check that the file opened. The simplest version of this function
has no error checking so the body of the function is simply
fp = open(“GDP.txt”)
return fp
Hint: Start with that as your function body and add the try-except error checking later.
2. The find_min_percent(line) function takes one argument, one line (str) from the
GDP.txt file. It returns the minimal value (float) in that line and an index (int) indicating
where the minimal value is in the line (you get to decide what value is the index—there are
multiple possibilities). You can return two values simply by separating them by commas:
return min_value, min_value_index
Here is an algorithm to find a minimal value of a series of values that you read. The algorithm
is written in pseudo-code so it looks somewhat like Python, but needs details of Python to be
complete:
min_value = 10000000 # some large value
for each value
if value < min_value: # you have found a smaller value
min_value = value # set min_value to that smaller value
Hint: Start by having your function simply print all the values in the line. That way you can
check that you are examining all the values. Once you can print all the values you can apply
the minimum algorithm to find the minimal value.
3. The find_max_percent(line) function takes one argument, one line from the
GDP.txt file. It returns the maximum value in that line and an index indicating where the
maximum value is in the line. This function is nearly identical to the find_min_percent
function. Instead of starting with a large value you start with a small value and reverse the
operator in the Boolean expression.
4. The find_gdp(line,index) function takes two arguments, one line from the GDP.txt
file and an index into the line. It returns the GDP value in that line at the specified index.
Note that this function can be used to find the GDP value associated with the minimal percent
change as well as the maximal percent change—simply call the function with a different index.
5. The display(min_val, min_year, min_val_gdp, max_val, max_year,
max_val_gdp) function takes six arguments: the minimal value, year and GDP, and the
same three for maximum (value, year, GDP). The function does not return anything. It
displays the values as shown in the sample below. Note that the GDP values in the file are in
billions whereas we expect you to convert billions to trillions for the output.
Use the following format string for displaying values:
"{:<10s}{:>8.1f}{:>6d}{:>18.2f}"
6. The main() function loops through the file to access lines 9 and 44, and sets up values for the
call to display function.
7. You are not allowed to use collections such as list, tuples and dictionaries. Specifically, do not
use the string .split() method—it returns a list. Use string slicing to extract values from
lines in the file.
8. You will be responsible for adhering to items 1-6 of the Coding Standard.
Suggested Procedure
• Solve the problem using pencil and paper first. You cannot write a program until you have
figured out how to solve the problem. This first step can be done collaboratively with another
student. However, once the discussion turns to Python specifics and the subsequent writing of
Python, you must work on your own.
• Use Anaconda to create a new program. Use the required file name (proj03.py).
• Write a simple version of the program, e.g. open the file and print all the lines in the file. Run
the program and track down any errors. Add one function and retest.
• Use the Mirmir system to turn in the first version of your program.
• Cycle through the steps to incrementally develop your program:
o Edit your program to add new capabilities.
o Run the program and fix any errors.
• Use the Mirmir system to submit your final version.
Sample Output
Test 1
Enter a file name: xxxx
Error. Please try again
Enter a file name: GDP.txt
Gross Domestic Product
min/max change year GDP (trillions)
min -2.8 2009 14.42
max 7.3 1984 4.04
Grading Rubric
Computer Project #3 Scoring Summary
General Requirements
__0__ ( 5 pts) Coding standard
Program Implementation
__0__ (10 pts) Function (no Mirmir test): open_file
__0__ (10 pts) Function Test 2: find_min_percent
__0__ ( 5 pts) Function Test 3: find_max_percent
__0__ (15 pts) Function Test 4: find_gdp
__0__ (10 pts) Function (no Mirmir test): display
__0__ (20 pts) Test 1
版权所有:编程辅导网 2021 All Rights Reserved 联系方式:QQ:99515681 微信:codinghelp 电子信箱:99515681@qq.com
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。