Machine Learning for Agriculture and Natural Sciences
Explore the fundamentals of supervised learning algorithms, especially for agricultural and natural science datasets. Implement and interpret Machine Learning models, starting with statistical models. Learn how to choose and divide your datasets (testing and training). Select appropriate variables.
Then brush up on your Python and move on to Deep Learning. Learn about the XOR problem, back-propagation, and regularization. Learn what convolution is, and use it to classify images. Learn about U-Net and ResNet, and their applications in machine vision.
Syllabus and other information
Syllabus
P000137 Machine Learning for Agriculture and Natural Sciences, 7.5 Credits
Subjects
Education cycle
Postgraduate levelGrading scale
Language
EnglishPrior knowledge
Admitted to a postgraduate program in animal science, biology, veterinary medicine, food science, nutrition, nursing, or related subjects, or to a residency program in veterinary science. Alternatively, admitted to a postgraduate program in Bioinformatics or Biological field with a significant computational aspect. VMAS course Statistics I: Basic Statistics or similar (including regression). Introduction to Python for data science, PVG0045, or equivalent Python coursework or experience. If you want clarification on what constitutes sufficient Python knowledge or advice on how to obtain it, please contact the course coordinator.Objectives
On completion of the course, the student should be able to:
- Understand basics of statistical machine learning, including variable selection, cross-validation, and supervised and unsupervised learning.
- *Understand basic concepts of Deep Learning and its historical development, including perceptrons, back-propagation, and under/over training. *
- *Understand and select input/output encoding, activation function, as well as the types of layers. *
- Solve problems in binary and multi-class classification, especially in image analysis.
- *Apply the below listed concepts to important problems in Natural Sciences *
Content
Supervised Learning Applications in Agriculture and Natural Sciences. Explore the fundamentals of supervised learning algorithms tailored for agricultural andnatural science datasets. Implement and interpret models such as logistic regression, linear regression, regularization and shrinkage techniques, non-linear regression, discriminant analysis, regression trees, random forests, Gradient Boosting, and support vector machines.
Cross Validation Techniques: Understand the importance and application of cross-validation methods to assess model performance and generalization ability. Effectively implement and interpret cross-validation techniques to optimize model parameters and mitigate overfitting.
Variable Selection in High-Dimensional Data: Explore strategies for variable selection in machine learning models, particularly in the context of high-dimensional datasets common in agriculture and natural sciences. Identifying and prioritize relevant features to enhance model accuracy and interpretability.
Unsupervised Learning Techniques: Investigate unsupervised learning methods for extracting meaningful insights from unlabelled data in agriculture and natural sciences. Understand and apply principal component analysis (PCA) for dimensionality reduction and visualization, as well as various clustering algorithms including K-means, K-Medoids, DBSCAN, and Clara
Review of Python in preparation for Deep Learning portion of course.
The XOR problem, and its relation to hidden layers. Back-propagation: how it works, and how it relates to your choice of activation function.
Regularization, Frameworks, Regression, Convolutional layers and their role in image classification.
UNet and ResNet.
In person:
Course days will begin with a theory discussion (1-2 hours or as needed). The rest of the day will consist of laboratory exercises.
Labs early in the week will cover specific concepts, while labs later in the week will be directed towards a practical problems, e.g. gene selection, image classification. For the lab exercises you will turn in code and answer a quiz. Evaluation will be based on conceptual understanding and appropriate of application of the learned techniques. There will be time scheduled on the last in-person day to kick off the post-course assignment.
As a post-course assignment, the student may complete a novel task, e.g. in image classification. A specific problem will be suggested, or alternatively come from the student’s research. A report and code will be turned in.
Grading of this will be based on fulfilment of objectives, e.g. use of specific techniques, as well as appropriateness of the approach. While some degree of predictive accuracy is desired, demonstration of conceptual knowledge is more important.
Formats and requirements for examination
Examination will be primarily by assignments to be turned in. In these assignments, the student will implement methods, turn in code, and critically analyze results. The assignments will be in written or quiz form. Evaluation will be based on theoretical understanding, technical performance, and ability to interpret results. There will also be quiz questions testing theoretical knowledge.Additional information
Discussions will be in person. At-home preparation includes watching lecture videos and reading from books and articles.Laboratory exercises are started in class with guidance from instructor. However they may be continued at home if needed and turned in online.
Class meetings are in Ultuna. Occasionally the instructor and/or students may participate via Zoom, but this is not the norm.
Responsible department
Department of Animal Biosciences