Chapter 1 Welcome

Welcome to the Mathematical Marine Ecology Lab.

This book has been put together as the main source of information for all members of the MME Lab. It includes info on lab meetings, administration, data-storage, open data-science, R programming, writing skills and GitHub.

1.1 Lab Meetings

The MME lab meets up every week on Level 5 of the Queenland Bioscience Precinct. We generally use this as an opportunity to catch up on what everyone is doing, and share tips and tricks around data, programming and open data-science. Each meeting, one person is nominated to lead a discussion on a specific topic. Contact Anthony to be added to the email list. You can also attend virtually.

1.2 General Administration

1.2.1 Lab Contact:

For any lab-specific issues, you should contact Anthony in the first instance: Professory Anthony Richardson: ajr at maths.uq.edu.au

1.2.2 Lab Communication

1.2.2.1 Email

For formal communication that must be received, read and stored.

1.2.2.2 GitHub Issues Tracking

As you will see below, the MME Lab is using GitHub for project tracking. So sign up and join MME (Send Jason an email to request access to MME)

For specific questions and issue tracking that relates to a project. Each issue can have its own converstaion chain and we can track the progress of each question. Tag people to notify them of something they need to deal with.

1.2.2.3 Gitter

For idle chit-chat, tips/tricks, cool papers etc. Things that are not critical for people to see in a short timeframe. Just another messanger service, but one which is tied to GitHub. Sign up using your Github account. https://gitter.im

1.2.3 School of Maths and Physics

You can find all the information you need about SMP and relevent contacts, forms etc at the SMP website. https://smp.uq.edu.au

1.2.4 General IT Support

All general UQ IT-related information can be found at: https://my.uq.edu.au/information-and-services/information-technology

1.2.5 Student Support

The University of Queensland Student Support Services can be found at: https://www.uq.edu.au/student-services/

This includes information and support services for student welfare, learning support, faith, health etc. All students are encouraged to familiarise themseleves with this page.

1.3 Computing

Our lab are using the UQ Research Data Management (https://rdm.uq.edu.au/) to store data and write model output, and the UQ High Performace Computer “Awoonga”, “Flashlite” and “Tinaroo” for computing tasks. More information on each is at https://rcc.uq.edu.au/node/48.

If you have specific questions that can’t be answered by your lab colleagues, you can search the Research Computing Centre website (https://rcc.uq.edu.au).

1.3.1 Research Data Manager

Research Data Manager (RDM) is an integrated data management system covering the entire research data lifecycle.

The RDM will ensure that research data:

  • remains accurate, authentic, reliable, and complete,
  • retains integrity and improves the reproducibility of research,
  • is secure, minimising the risk of data loss,
  • can be reused, enabled by collecting critical metadata early,
  • is available in accordance with the Australian Code for the Responsible Conduct of Research and the FAIR guiding principles, and meets funding, journal and regulatory body requirements.

We have two collections on the RDM.

  1. MathMarEcol Data Storage (MME1DATA) which is where we store all our Read-Only datasets such as CMIP model outputs, satellite data, fisheries data etc. This is where you should store all the data sets you will be reading into your analysis. Please consult with Ant about your requirements and for access.

  2. MathMarEcol Working Projects (MME2WORK) is where we store all our dynamic output data (model outputs, figures etc). When you get access to this collection you should create a folder for your project (We suggest aligning this to the GitHub Repo name to avoid any confusion).

1.4 Useful Tools

1.4.1 Software

Some software you may find useful:

  • R:
  • RStudio: The best IDE for R.
  • Endnote: The UQs main reference manager software
  • Notepad++: A good text editor for Windows which deals with invisible characters, line endings etc much better than the default Notepad.
  • Sublime: A powerful text editor for MacOS.
  • GitHub Desktop: A simple Github manager for those repos you need to manage outside of RStudio.
  • FileZilla: A great SFTP client for MacOS/Windows/Linux for moving files around the university.

Others?

1.4.2 R Resources

1.4.2.1 Books

R for Data Science (Otherwise known as R4DS) by Hadley Wickham and Garrett Grolemund. This book will teach you how to do data science with R: You’ll learn how to get your data into R, get it into the most useful structure, transform it, visualise it and model it.
https://r4ds.had.co.nz

Advanced R by Hadley Wickham. This book is designed primarily for R users who want to improve their programming skills and understanding of the language
http://adv-r.had.co.nz

R Graphics Cookbook by Winston Chang. This book is a practical guide that provides more than 150 recipes to help you generate high-quality graphs quickly, without having to comb through all the details of R’s graphing systems.
https://r-graphics.org

Efficient R Programming by Colin Gillespie and Robin Lovelace. This hands-on book teaches novices and experienced R users how to write efficient R code. https://csgillespie.github.io/efficientR/

R Packages by Hadley Wickham. The goal of this book is to teach you how to develop packages so that you can write your own, not just use other people’s.
http://r-pkgs.had.co.nz

Text Mining with R by Julia Silge and David Robinson. This book serves as an introduction of text mining using the tidytext package and other tidy tools in R.
https://www.tidytextmining.com

Bookdown by Yihui Xie. This book introduces an R package, bookdown, to change your workflow of writing books, websites or even theses. It was how we put together this website.
https://bookdown.org/yihui/bookdown/

1.4.2.2 Online Resources

rOpenSci Community Calls - Live presentations and recordings which allow you to hear about best practices, new projects, Q&As with well known developers, and to learn more about rOpenSci developments.
https://ropensci.org/commcalls/

1.4.3 R Packages

The Tidyverse
https://www.tidyverse.org
The tidyverse is an opinionated collection of R packages designed for data science. All packages share an underlying design philosophy, grammar, and data structures.

Patchwork https://patchwork.data-imaginist.com The goal of patchwork is to make it ridiculously simple to combine separate ggplots into the same graphic. As such it tries to solve the same problem as gridExtra::grid.arrange() and cowplot::plot_grid but using an API that incites exploration and iteration, and scales to arbitrily complex layouts.

Stars Package
https://r-spatial.github.io/stars/
STARS is good for non-regular grid, which is what the hi-res oceanographic models generally are these days.

ClimateOperators package
https://github.com/markpayneatwork/ClimateOperators
Useful for being able to run CDO in R.

TidyText Package
https://github.com/juliasilge/tidytext
Text mining using dplyr, ggplot2, and other tidy tools

rOpenSci Packages
https://ropensci.org/packages/
rOpenSci, which fosters a culture of open and reproducible research using shared data and reusable software, has collated a range of useful packages, including rNOAA, taxize, BOMrang and scrubr.

Paleteer
https://emilhvitfeldt.github.io/paletteer/
The goal of paletteer is to be a comprehensive collection of color palettes in R using a common interface. This package includes 1771 from 50 different packages

Simple Features Package (sf)
https://r-spatial.github.io/sf/articles/sf1.html
Simple features refers to a formal standard (ISO 19125-1:2004) that describes how objects in the real world can be represented in computers, with emphasis on the spatial geometry of these objects. These are very easy to use with ggplot and are great for plotting maps and analysing spatial data in R.

1.5 UN Sustainable Development Goals

The UN Sustainable Development Goals are a valuable resource for everyone to provide the global context for their research.

The Sustainable Development Goals (SDGs) are a set of seventeen interconnected “Global Goals” that United Nations members have committed to achieving by 2030 to help deliver sustainable prosperity. These have replaced the Millennium Development Goals (MDG) and are shaping the future of development globally. There is broad agreement that the MDGs provided a focal point for government policy development and aid programmes, but they failed to consider the root causes of issues around poverty, gender inequality, the interconnected nature of drivers and impacts of development. The intent is to transform human activity so that it directly addresses and reduces armed conflict, poverty, disease, congestion, environmental degradation and climate change while also improving equality, education and quality of life.

This is intentionally aspirational, with the vision and goals derived from the largest consultation programme in UN history (involving representatives from 70 countries, 11 thematic and 83 national consultations and door-to-door and an online My World surveys). Governments around the world have accepted the goals with countries like Denmark and Colombia building the SDGs into the foundation of their national strategic prosperity plans and policies across all departmental areas. This means the SDGs are fundamental to all aspects of regulatory, government and daily activities and decision making.

Globally, the private sector is seen to have a major role in the implementation of the SDGs. The International Chamber of Commerce (ICC) has explicitly stated that sustainability is no longer a luxury business investment, but a core driver of business productivity and growth. Consequently, the ICC developed the Business Charter for Sustainable Development to help companies contribute to the SDG implementation, as responsible business is the new normal “At the heart of economic growth are innovative, successful, and responsible businesses operating within strong, forward-looking governance and policy frameworks”

(https://iccwbo.org/publication/icc-business-charter-for-sustainable-development-business-contributions-to-the-un-sustainable-development-goals/).

Thanks to Dr Beth Fulton from CSIRO for the text.

1.6 Open Data Science

While we have always made our data and code available to those who have asked, in 2019 the MME Lab has decided to adopt a full open data-science model. We attended the wonderful Data Science Training (http://ohi-science.org/data-science-training/) run by the The Ocean Health Index Team. We acknowledge all the help and resources they have made available which has allowed us to get up and running so quickly.

Many sections of this book introduce open data science so we can work with data in an open, reproducible, and collaborative way. Open data science means that methods, data, and code are available so that others can access, reuse, and build from it without much fuss. Here we demonstrate a workflow with R, RStudio, Git, and GitHub, as described in Lowndes et al. 2017, Nature Ecology & Evolution: Our path to better science in less time using open data science tools.

1.7 Online Code of Conduct

We have adopted the code of conduct from Mozilla to govern participation in all online communities associated with the MME Lab, including GitHub, Gitter, GoogleDocs, email and lab meetings. As part of this we must always:

  • Be respectful and value each other’s ideas, styles and viewpoints.
  • Be direct but professional; we cannot withhold hard truths.
  • Be inclusive and help new perspectives be heard.
  • Be open to learning from others.
  • Lead by example and match your actions with your words.

For a full description of the Community Participation Guidelines see: https://www.mozilla.org/en-US/about/governance/policies/participation/