Due: Friday, October 16 at 11:59 pm PT
Objective: This assignment will give you experience using a Google Colab notebook to write text and code, including variables, mathematical operations, and string manipulations.
Instructions:
Honor code: In the space below, you can acknowledge and describe any assistance you've received on this assignment, whether that was from an instructor, classmate (either directly or on Piazza), and/or online resources other than official Python documentation websites like docs.python.org or numpy.org. Alternatively, if you prefer, you may acknowledge assistance at the relevant point(s) in your code using a Python comment (#).
Acknowledge assistance here:
Would it make the most sense to use a Colab notebook ('A'), Jupyter notebook ('B'), Python script ('C'), or the Python command line ('D') for the following purposes?
To provide your answers, create variables below and assign to them a string with each answer. For instance:
question_1_part_6 = 'C' # this would indicate you've chosen a Python script for part (6) of the question
Scenarios:
# Provide your answers below:
Although the goal of OCEAN 215 is not to teach "data science" per se, we hope this course will give you a window into the possibilities offered by data analysis. If you find that you enjoy working with data and writing computer code, you might want to consider a career in academic research or data science. But what is data science, precisely? To better understand this rapidly-growing field, we would like you to read the following article (which should take about 15 minutes), then answer the questions below. There are no wrong answers, but please respond thoughtfully with 1-3 sentences for each (2 points each).
Article: "What is data science?" Thinkful. https://www.thinkful.com/blog/what-is-data-science/. Read up until the "Your Turn" section.
What data do you think you have generated today that might, at some point, be analyzed by a data scientist?
Can you think of an oceanography data set that could be considered "big data"? Why would it be classified as such?
The article describes how machine learning can be useful for making predictions and identifying relationships in data. Can you think of a research question or task in oceanography or your academic area of interest that might benefit from machine learning? Why might machine learning be particularly useful for that task?
Provide your responses here:
Look at the following table of monthly weather averages in Honolulu, HI. (Source: Google)
\
Month | High T (˚F) | Low T (˚F) |
---|---|---|
January | 81 | 65 |
February | 81 | 65 |
March | 82 | 67 |
April | 83 | 68 |
May | 85 | 69 |
June | 86 | 72 |
July | 87 | 73 |
August | 88 | 73 |
September | 88 | 73 |
October | 87 | 72 |
November | 84 | 70 |
December | 82 | 67 |
\
Answer the following questions using Python code. Please have your code output (print) something for each part. This output should start with the question part – e.g. print('Part 1:',...) – and include the specified lists or calculations and the answers to questions.
# Provide your answers below:
Challenger Deep, at a depth of 36,200 feet, is known as the deepest part of the ocean (source: NOAA Ocean Facts). Answer the following questions using Python code. Please have your code output (print) the solutions to each part, including the units! Start your output with the question part, e.g. print('Part 1:',...). If you do not know a conversion factor, feel free to look it up online, but acknowledge the source.
# Provide your answers below:
Ribosomal RNA (rRNA) gene sequencing is often used in phylogenetic research. Each sequence is made up of combinations of 4 different nucleobases, adenine (A), cytosine (C), guanine (G), and uracil (U). Every species of bacteria and archaea has a slightly-unique version of its 16s rRNA gene. This means that 16s rRNA can be targeted to classify what marine bacteria are present in a given parcel of seawater.
In the code box below, there are two strings containing different DNA gene sequences, which differ from RNA in that they contain thymine (T) nucleobases rather than uracil (U). Answer the following questions using Python. Please have your code output (print) the solutions to each, including any strings you are asked to create or slice. Start your output with the question part, e.g. print('Part 1:',...).
# Keep these starting lines of code – these may not be changed:
DNA1 = 'GGGGGGCAGCAGTGGGGAATATTGGGCAATGGACGAAAGTCTGACCCAGCCATGCCGCGTGTGTGAAGAAGGCTCTAGGGTTGTAATGCACTTTAAGTAGGGAGGAAAGGTTGTGTGTTAATAGCACATAGCTGTGACGTTACCTACAGAATAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCGCGTAGGCGGTTATTTAAGCTAGATGTGAAAGCCCAGGGCTCAACCTTGGAATTGCATTTAGAACTGGGTAGCTAGAGTACAAGAGAGGGTGGTGGAATTTCCAGTGTAGCGGTGAAATGCGTAGAGATTGGAAGGAACATCAGTGGCGAAGGCGGCCACCTGGATTGATACTGACGCTGAGGTGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCCAGTAGTCCT'
DNA2 = 'GGGGCGCAGCAGTGGGGAATATTGCACAATGGGCGAAAGCCTGATGCAGCCATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCGCTTTCAGTTGTGAGGAAAGGGGTGTAGTTAATAGCTACATCCTGTGACGTTAGCAACAGAAGAAGCACCGGCTAACTTCGTGCCAGCAGCCTCGGTAATACGAGGGGTGCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGTTCGTAGGCGGTTTGTTAAGCAAGATGTGAAAGCCCTGGGCTCAACCTGGGAACTGCATTTTGAACTGGCAAACTAGAGTACTGTAGAGGGTGGTGGAATTTCCAGTGTAGCGGTGAAATGCGTAGAGATTGGAAGGAACATCAGTGGCGAAGGCGGCCACCTGGACAGATACTGACGCTGAGGAACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTAGTAGTCTG'
# Provide your answers below:
In the code box below, there is a list of rorqual whales containing both their common name, their scientific name as a sub-list separated into genus and species, and their estimated populations (source: The International Whaling Commission). As before, begin your answers with the question part, e.g. print('Part 1:',...).
# Keep these starting lines of code – these may not be changed:
rorqual_whales = [['Blue whale',['Balaenoptera','musculus'],'8000'],
['Bryde\'s whale',['Balaenoptera','brydei'],'90000'],
['Common minke whale',['Balaenoptera','acutorostrata'],'200000'],
['Fin whale', ['Balaenoptera','physalus'],'90000'],
['Sei whale', ['Balaenoptera','borealis'],'50000'],
['Antarctic minke whale',['Balaenoptera', 'bonaerensis'],'510000']]
# Provide your answers below:
This is a coding puzzle. Write 4 or fewer lines of code (to follow the 3 lines provided) such that, after executing the code:
Here's the catch: the code you write may contain mathematical operations (for instance, z = x + y), but may not contain any numbers and may not add a variable to itself (for instance, z = x + x).
# Keep these starting lines of code – these may not be changed:
x = 4
y = 12
z = 'Hello'
# Write your own code below: