Examensarbete i matematik på grundnivå

	Matematik

KTH Matematik

Examensarbete i matematik på grundnivå med inriktning mot optimeringslära och systemteori

(kurskod SA114X, SA115X och SA120X 15hp)

Allmän information om examensarbetet på studentwebben

Specifik information för ämnesvalet matematik

Optimeringslära och systemteori

Optimeringslära och systemteori är ett tillämpat matematiskt ämne som omfattar teori, modeller och metoder för optimering samt systemteoretiska aspekter inom ämnen som biologi, maskinteknik, reglerteknik, robotik, signalbehandling, maskininlärning och AI.

Vårens projektarbeten i optimeringslära och systemteori 2026. Obs, listan uppdateras ännu (förväntas vara helt klar senast 1:a november)

För att välja ett projekt med oss, maila jankr@kth.se, och skriv vilka projekt ni helst vill jobba med (första val, andra val, och tredjeval). Vi försöker alltid ge er ert första val (det lyckas ofta). Deadline för att skicka in önskemål om projekt är December 1.

Flera av projekten relaterar till befintlig forskning inom avdelningen och det finns i Stockholmsområdet ledande industri och forskningsföretag inom dessa tillämpningsområden. Andra projekt behandlar grundläggande matematiska problem inom ämnet vilka kan tillämpas inom många områden.

Projekt som vi erbjuder VT26

Learning to Steer Strategic Agents with Fast Convergence

With the growing involvement of humans and strategic agents in societal systems, such as intelligent infrastructure, transportation networks, and smart grids, it is crucial to address the so-called Incentive Design (ID) problem. In ID, a system-level planner seeks to steer agents toward desired behaviors by issuing appropriate incentives (rewards or penalties) into their objective functions. To evaluate effectiveness of an incentive, however, the planner must wait for the agents to settle into a stable response. This naturally gives rise to a two-timescale dynamic: incentive update must evolve on a slower timescale than agents' behavioral responses, yet still fast enough to ensure the overall convergence of the ID process.

This project investigates this two-timescale interaction in the ID problem through three subtasks: (i) Apply two-timescale stochastic approximation theory to derive the conservative step-size rule for incentive updates and agents' strategy updates, respectively. (ii) Building on the conservative rates above, develop a reinforcement learning (RL) algorithm to adaptively adjust the planner's incentive-update step size. The goal will be to accelerate the system's convergence to the desired behavior without impacting the method's stability. (iii) Compare the regret of the RL-based update algorithm with that of the two-timescale stochastic approximation baseline, and evaluate the improvements.

Optimering av vattenkraftverk

Optimering används inom många områden. Speciellt är optimering viktigt för att planera drift av vattenkraftverk i en älv. Typiskt är modellerna som används linjära. Inom ramen för KEX-arbeten har ett par utvidgningar av den linjära modellen studerats för att se hur stor skillnaden blir från en linjär modell. Arbetet syftar till att studera andra utvidgningar av den linjära modellen på liknande sätt. Arbetet kommer att utföras i samverkan med avdelningen för elkraftteknik på KTH.

Rekimmenderade förkunskaper: SF1811/1861

Dynamical Incentive Design via Reinforcement Learning

With the rapid proliferation of digital technologies, incentive design has emerged as a key paradigm in economics, public policy, and engineering systems. Its primary goal is to shape the reward or cost structures faced by individuals (agents) so that the resulting Nash equilibria collectively improve overall system outcomes. This work investigates the problem of incentive design in dynamic multi-agent games. The objective is to learn incentive policies through reinforcement learning, based on the observed dynamics of agents' trajectories, to guide their Nash equilibrium toward the desired optimal outcomes. The newly developed data-driven incentive mechanisms will be compared with traditional model-based incentive design methods and will be validated in concrete application models such as smart grids and intelligent transportation systems.

Analys av neurala nätverk med mixed-integer programming

Neurala Nätverk (NN) är känsliga mot attacker. Små (svårupptäckta) förändringar i in-data kan leda till stora (katastrofala) förändringar i resultat. På senare år har intresset för att kunna garantera att ett NN är robust ökat. Ett tränat NN can modelleras matematiskt genom att använda binära variabler. Vi kan på så sätt formulera optimerings-problem så som: givet en liten förändring i in-data: maximera värsta möjliga utfall.
I detta projekt kommer vi att träna ett NN, formulera den matematiska modellen, och lösa den över vårat NN. Implementation kommer att ske i Python.

Beroende to studentens förkunskaper/intresse finns möjlighet att något styra projektets fokus. Antingen titta närmre på NN (implementera olika NN) eller på optimerings-aspekten (olika sätt att formulera den matematiska modellen). Studenten rekommenderas därför att antingen 1) vara bekväm med Python och/eller ha arbetat med verktyg så som PyTorch, eller 2) ha läst en grundkurs i optimering (t.ex., SF1811 eller SF1861).

Image morphing via optimal transport

The optimal transport (OT) problem seeks the most efficient way to move mass from one distribution to another. In this project, OT will be applied to image interpolation, where one image is gradually transformed into another. By representing images as probability distributions, the Wasserstein distance can be used to compute intermediate images, known as Wasserstein barycenters, that form a smooth transition between the originals. The project will involve implementing the image morphing problem as a linear program, and test also approximation techniques such as the Sinkhorn algorithm to generate a sequence of interpolated images with optimal transport.

Physics-Informed Neural Networks for Optimal Path Planning of Unicycle Robots

Path planning for unicycle robots is typically addressed by classical algorithms that separate planning from control. This project explores an alternative paradigm using Physics-Informed Neural Networks (PINNs) to generate optimal, continuous trajectories end-to-end. The PINN is designed to output robot states and control inputs, constrained by a loss function that encodes the unicycle kinematics, start-goal states, and a minimal control effort objective. The model will be implemented and tested in simulation.

Multiple Target Tracking via Graph-Structured Multimarginal Optimal Transport

The field of Multiple Target Tracking (MTT) deals with the task of estimating targets which appear, disappear, and move within a scene, given data from noisy measurements. An important step that needs to be performed in an MTT algorithm is that of data association, that is, creating hypotheses of how the targets are moving given the detections. This step is further complicated by the presence of erroneous detections, which need to be handled as false positives, and the fact that computations need to be done efficiently in an online setting. The idea behind this project is to model the problem within the framework of graph-structured multimarginal optimal transport, and then to solve it using techniques found in literature.

Robust portföljoptimering

I portföljoptimering är målet att välja investering på ett så bra sätt som möjligt för det givna måttet. Oftast används förväntad avkastning och varians för att modellera detta. Problemet är att skattningen av dessa kan ha stor osäkerhet. Robust portföljoptimering hanterar detta genom att hitta den portfölj som är bäst för alla eller en viss andel av rimliga utfall. Projektet skulle bestå av att förstå hur man formulerar och löser problem i robust portföljoptimering, och jämföra det mot klassisk portföljoptimering. En kort introduktion till ämnet kan läsas här: https://docs.mosek.com/portfolio-cookbook/robustopt.html

Schemaläggning av tentor

Schemaläggning är ett typ av problem där heltalsoptimering kan användas för att formulera och lösa problemet. Till exempel i artikeln ''Final Exam Scheduling at Bucknell University: A Case Study and Open-Source Tool'' av Chaplin et al. (2025) utvecklade författarna en modell för att schemalägga tentor som tar hänsyn till studenternas preferenser. Förslagsvis skulle projektet gå ut på att förstå deras modell, implementera den i något modelleringsspråk och se hur väl det går att lösa. Man kan då undersöka vad som går att göra för att förbättra lösningstiden eller lösningskvalitén. Det finns många liknande artiklar om schemaläggning, så andra typer av schemaläggningsproblem går också att undersöka.

On the simplex method and the interior-point method for linear programming

A classical method for solving linear programming problems is the simplex method. This method has the drawback that there is (yet) no variant known that does not admit exponential running time in the worst case even if it usually runs in polynomial time. An efficient competitor for the simplex method, whose generalization is also used for non-linear programming, is the interior-point method, with a proven (weakly-)polynomial worst-case running time. Both methods have their strengths, and they generate their solution in an inherently different way, with the simplex method traveling through potential optimal solutions and the interior-point method aggregating the whole global picture and traveling through the interior of the feasible set.

In this project, you will study both methods and compare their behavior from both a theoretical and a practical point of view using your own implementation of the two methods in a suitable programming language. Suggested directions are to study the 'crossover' between the simplex method and the interior-point method to get a vertex solution for the latter as well, to explore different variations of the method (pivoting rules and primal versus primal-dual) on a dataset of test problems or to look at pathological examples for one method and see if the other method can handle this better. It is strongly advised to have taken a basic course in optimization (e.g. SF1811 or SF1861).

Optimering av lagerhållningssystem för varor med bäst före datum

Om man vill hålla nere kostnaderna för lagerhållning och minska svinnet så krävs planering vid beställning av nya varor och en god kännedom om efterfrågan. Vi vill studera en enkel modell och se på effekterna av kundbeteende och prissättning.

Dynamic optimization with truncation error estimation

Dynamic optimization is key in industrial manufacturing for off-line tasks (e.g., transitions between operating conditions, design of control systems) and on-line tasks, particularly in nonlinear model predictive control (NMPC). Automated tools are available to directly transcribe dynamic optimization problems into algebraic models (e.g., Pyomo.DAE), enabling efficient use of nonlinear optimization algorithms for fast and reliable problem solving. However, the selection of parameters such as the discretization scheme or the number of finite elements still relies on user experimentation. For instance, software such as Pyomo.DAE does not provide any guidelines or measures related to discretization accuracy (i.e., truncation error). From the user perspective, the ideal dynamic optimization software should be able to automatically select discretization parameters, or at least return measures related to the discretization error. Furthermore, the rule of thumb of adding as many finite elements as possible is not always feasible in practice; for example, using as few finite elements as possible while returning accurate solutions is desired in on-line tools such as NMPC.

This project aims to study the aforementioned issues by 1) proposing code able to provide accuracy measures for the solution of a dynamic optimization problem using principles from Richardson extrapolation and 2) based on these measures, proposing an algorithm or guidelines to choose the number of finite elements needed for the discretization.

Formation control of autonomous robots

In this project we aim to design control strategies for a group of autonomous robots (moving in a two-dimensional plane) so as to achieve formations of interest. The robots are subject to nonholonomic constraints, and therefore their dynamics can be modeled as a unicycle. Unlike most formation control approaches that rely on absolute state feedback and tracking error with respect to a predefined formation, our focus is on using only local measurements and as little information about the formation as possible. We will use simulation to validate the effectiveness of the strategies and study the stability properties of the emergent formations.

Optimisation of a Drone fleet for a Dark Warehouse

Automation in factories and warehouses are becoming increasingly more common. This has even coined a new term called dark factories. In these factories and warehouses the light is constantly turned off for efficiency reasons made available by the lack of human personnel.
In this project we will deal with the operation of one such dark warehouse. More specifically the task will be to coordinate a fleet of robots that move, store and package goods entering and exiting the warehouse. The goal will be to model and implement an algorithm that optimise various performance metrics of said fleet. This could include everything from optimising the routes taken to minimise the energy consumption and maximise throughput as new goods come in. The direction taken will depend on the interest of the students and will be discussed with the supervisor at the start of the thesis.

Projekten skall normalt genomföras i grupper om två studenter men det är även möjligt att arbeta individuellt. Vi kan också hjälpa er att hitta nån att jobba med. I en del projekt kan det finnas flera (2 eller 3) delprojekt. Samtliga projekt har en inläsningsdel och en problemlösningsdel. Inläsningsdelen är gemensam för alla grupperna i varje projekt medan problemlösningsdelen skall utföras självständigt inom de olika delprojekten.

Inläsningsdel

Projekten inleds med en inläsningsdel under LP3. Inläsningen av ämnet sker i form av en informell lärarledd studiecirkel, där deltagarna hjälps åt att lära sig den nödvändiga teorin. Denna delen av kursen avslutas med att studenterna i varje delprojekt presenterar sitt delproblem och sin arbetsplan för projektet. Detaljerna kring upplägget varierar lite grand mellan de olika projekten.

Problemlösningsdel

Problemlösningen utförs i huvudsak under LP4. Här skall grupperna självständigt arbeta med sina problem. Normalt träffas gruppen och lärarna en gång per vecka för att diskutera projektens status.

Kontaktpersoner

För frågor angående inriktningen mot optimeringslära och systemteori: Jan Kronqvist

Ungefärlig tidsplan för projektarbetet

Januari: Arbetetet påbörjas med inläsning.
Början av mars: Projektformuleringar och arbetsplan ska finnas färdiga.
Mitten/slutet av mars: Studenten lämnar disposition och skelett till handledaren.
Början av maj: Rapport lämnas till handledaren för granskning.
Mitten/slutet av maj: Redovisning, plagiatgranskning och betygssättning.


	Sidansvarig: Xiaoming Hu