Computing and Mathematical Sciences Papers
Permanent URI for this collectionhttps://researchcommons.waikato.ac.nz/handle/10289/6
This collection houses research from the School of Computing and Mathematical Sciences at the University of Waikato.
Browse
Recent Submissions
Item type: Item , CodeWars: Using LLMs for vulnerability analysis in cybersecurity education(The Colloquium for Information Systems Security Education, 2026-03-21) Chaudhary, Arunima; Colombo, Gualtiero; Javed, Amir; Haseeb, Junaid; Kumar, Vimal; Larsen, RichardLarge Language Models (LLMs) are increasingly explored as tools for software development and could further constitute a supplementary source for the development of varied examples intended for pedagogical use. While they can improve productivity, their ability to produce code that is both secure and compliant with Secure Software Development (SSD) practices remains uncertain, raising concerns about their role in cybersecurity education. If LLMs are to be integrated effectively, students must be trained to critically evaluate generated code for correctness and vulnerabilities, raising an important question: How can LLM-generated code be effectively and securely incorporated into Cybersecurity education for teaching vulnerability analysis? This paper introduces CodeWars, a novel teaching methodology that combines LLM-generated and human-written code to examine how students engage with vulnerability detection tasks. CodeWars was implemented as a pilot study with a total of 32 students at Cardiff University and the University of Waikato, where students analyzed flawed, secure, and mixed-origin code samples. By comparing student approaches, analysis, and perceptions, the study provides insights into how vulnerabilities are detected, how code origins are distinguished, and how SSD practices are applied. Our analysis of student feedback and interviews indicates that Codewars produced structured and accessible code, simplifying vulnerability identification and offering educators the means to efficiently develop varied SSD teaching applications. These findings illuminate both the advantages and constraints of employing LLMs in secure coding and position this study as a foundational step toward the responsible adoption of AI in Cybersecurity Education.Item type: Item , Empowering education: Student perceptions and attitudes to the role of social robots in learning contexts(ACM, 2026-03-16) Turner, Jessica Dawn; Vanderschantz, Nicholas; Bowen, Judy; König, Jemma Lynette; Carino, HannahSuccessful integration of social robots in education relies on the acceptance of robots in learning contexts by students. Using a participatory design workshop, students interacted with a KettyBot and ideated potential roles for robots in the classroom. This was followed by a questionnaire and the Godspeed Questionnaire Series (GQS) to understand student perceptions and attitudes towards social robots in education environments. Learners described potential use cases and our results demonstrate students envision robots as assistants rather than teachers, emphasising the importance of human connection in learning.Item type: Item , “It almost wanted to hurt someone”: The impact of intentional creepiness on user perceptions(ACM, 2026-03-16) Turner, Jessica Dawn; Vanderschantz, Nicholas; König, Jemma Lynette; Siddika, RafeeaThe intentional design of robots to evoke creepiness provides a unique lens for studying human perception and willingness to engage. To understand user perceptions and acceptance of robots we developed a robot prototype designed with targeted facial, morphological, and movement features that may be perceived as "creepy". Using the Human-Robot Interaction Evaluation Scale (HRIES) we found that disturbance was moderate towards our intentionally creepy robot with significant participant variation. Furthermore, qualitative results confirmed this polarity, with descriptions ranging from "angry and unfriendly" to "cool and cute". This variability demonstrates that "creepiness" is more subjective than initially anticipated and highlights a key research gap in academic literature with the need for measurement tools which capture negative perceptions in HRI.Item type: Item , Accuracy of machine learning models versus "hand crafted" expert systems – A credit scoring case study(Elsevier, 2009) Ben-David, Arie; Frank, EibeRelatively few publications compare machine learning models with expert systems when applied to the same problem domain. Most publications emphasize those cases where the former beat the latter. Is it a realistic picture of the state of the art? Some other findings are presented here. The accuracy of a real world “mind crafted” credit scoring expert system is compared with dozens of machine learning models. The results show that while some machine learning models can surpass the expert system’s accuracy with statistical significance, most models do not. More interestingly, this happened only when the problem was treated as regression. In contrast, no machine learning model showed any statistically significant advantage over the expert system’s accuracy when the same problem was treated as classification. Since the true nature of the class data was ordinal, the latter is the more appropriate setting. It is also shown that the answer to the question is highly dependent on the meter that is being used to define accuracy.Item type: Item , Gene selection from microarray data for cancer classification - A machine learning approach(Elsevier, 2005) Wang, Yu; Tetko, Igor V.; Hall, Mark A.; Frank, Eibe; Facius, Axel; Mayer, Klaus F.X.; Mewes, Hans W.A DNA microarray can track the expression levels of thousands of genes simultaneously. Previous research has demonstrated that this technology can be useful in the classification of cancers. Cancer microarray data normally contains a small number of samples which have a large number of gene expression levels as features. To select relevant genes involved in different types of cancer remains a challenge. In order to extract useful gene information from cancer microarray data and reduce dimensionality, feature selection algorithms were systematically investigated in this study. Using a correlation-based feature selector combined with machine learning algorithms such as decision trees, nave Bayes and support vector machines, we show that classification performance at least as good as published results can be obtained on acute leukemia and diffuse large B-cell lymphoma microarray data sets. We also demonstrate that a combined use of different classification and feature selection approaches makes it possible to select relevant genes with high confidence. This is also the first paper which discusses both computational and biological evidence for the involvement of zyxin in leukaemogenesis.Item type: Item , 10-year survival comparison of two cemented implants in primary total hip arthroplasty for osteoarthritis: A New Zealand regional study(Springer, 2025) Pearce, Amy; Joshi, Chaitanya; Chan, Georgina; Lamberton, Tony; MacLean, Simon; Vane, Andrew; Hébert-Losier, KimIntroduction Compare 10-year survival of the cemented highly crosslinked polyethylene Exeter® Rimfit™ (Rimfit) Cup and its predecessor, the ultra-high molecular weight polyethylene Exeter® Contemporary Flanged Cup™ (ECF), both with an Exeter® V40™ stem, in primary total hip arthroplasty (THA) for osteoarthritis in the Bay of Plenty region of NZ. Method We extracted national registry data for THA surgeries in the region between 1 January 2003 and 30 June 2023 and report the 10-year survival and reasons for revision of the two fully cemented implants (n = 495). We compared standard Kaplan-Meier estimates using the log-rank test. Cox proportional hazard models investigated the potential influence of six patient variables on the survival of each implant: sex, age, body mass index (BMI), ethnicity, American Society of Anesthesiologists (ASA) rating, and funding source (public/private). Results No statistically significant difference in 10-year survival rate between the implants (p = 0.334) (ECF 95.6% [93.4, 97.9], Rimfit 97.0% [95.9, 98.2]) or statistically significant difference in revision reasons between the implants (p = 0.09) was noted. Cox regression revealed no statistically significant influence of any of the six patient variables on the 10-year survival of the ECF (p = 0.584) or Rimfit (p = 0.611). Conclusion Both implants exceeded 95% survival at 10-years, which is favourable compared to the corresponding 94.8% national survivorship of cemented implants in NZ. There is no statistically significant difference in the 10-year survival rate or reasons for revision of the two cemented implants compared in this region. The Rimfit appears a suitable alternative to the ECF, from a survival and revision perspective.Item type: Item , 15-year patient-reported outcomes of a cemented flanged cup and stem combination in primary total hip arthroplasty: A New Zealand study(SAGE, 2026) Pearce, Amy; Joshi, Chaitanya; Chan, Georgina; Lamberton, Tony; MacLean, Simon; Vane, Andrew; Hébert-Losier, KimMethods: We investigated 15-year patient-reported outcomes (PROMs) and their predictors in primary total hip arthroplasty (THA) for osteoarthritis using a cemented flanged cup and stem from a regional joint registry in New Zealand. Regional data were collected for all primary THAs with this cemented combination from 1 January 2003 to 30 June 2023 who had recorded PROMs on at least 1 occasion (n = 263). PROMs included Oxford Hip Score, Western Ontario and McMaster Universities Arthritis Index and Veterans Rand-12, evaluated against patient age, ethnicity, sex, body mass index (BMI), funding pathway, and American Society of Anesthesiologists (ASA) rating. Results: Significant improvements across preoperative PROMs were noted 1-year post-surgery, with a mean change above 23 in the Oxford Hip Score maintained at 5, 10, and 15 years (p ⩽ 0.001). Conclusions: Regression analysis indicated that being female, public funding, and higher BMI were associated with worse preoperative PROMs. Poorer preoperative scores, older age and ASA 3 rating correlated with poorer postoperative outcomes.Item type: Publication , Computer graphic art of clothing(2019) Soo, Chin-En KeithWe are living in an ever-changing world, where new methods are being introduced to carry out the most staightforward task, where new inventions are being proposed to ease our daily operations, where new ideas are popping out at every corner. Since the dawn of time, mankind has been using innovation and creativity to survive and enhance life. With the use of technology, it has enabled more possibility and more significant endeavour. Now and forever, we are dependant on technology, that has played a substantial role in our design solutions, which inescapably affect every one of us.Item type: Publication , Type brighter(Domus Argenia, 2013) Soo, Chin-En Keith; Soddu, C; Colabella, EType Brighter is intended as a new way of reading the alphabet. Shape, colour and pattern create memorable sequences based on characteristics of the letterform. By utilizing colour and repetition, readability is promoted. Each letter of the English alphabet is assigned a colour and positions, resulting in a full set of unique patterns. The user types using the keyboard, and the corresponding lights are shown on the light board. The simplicity of colour makes Type Brighter an alternative to more complicated communications such as mores code, and the use of pattern creates memorable sequences of colour. User can also experience the change of ambience, while the moving colour type projects an abstract story using light. Type Brighter aims to create a new visual language through light. Colour, shape and pattern are strong visual elements, and when combined create a memorable experience. Colour serves an important part of our everyday lives and is easily distinguishable in all situations, making Type Brighter effective in a range of applications.Item type: Publication , RenderRing(Domus Argenia, 2015) Soo, Chin-En Keith; Soddu, Celestino; Colabella, EnricaRenderring is a musical platform for intuitive composition. It enables users’ interaction to provide opportunity for anyone to draw a unique circle and translates the drawing into a piece of melody. Users are able to set the composition variables before they start (Tempo, time signature and number of notes). The process involves two parts: First is a collection of user input by getting user to draw any unique circle in a provided space. Second is an interpretation using the program to decipher the drawing and identify point of intersections on the musical staff. After which, the program will produce a unique piece of melody with the user’s drawing. The user can then proceed with options of redoing or saving the melody. Renderring aims to bring new experience to create melody with a vision to simplify complexity. Transferring oneself energy from one form to another by converting visual to sound. The process enables creativity and empowers everyone to express his or her hidden inner potentials by making straightforward music.Item type: Publication , Hueue(Domus Argenia, 2017) Soo, Chin-En Keith; Soddu, Celestino; Colabella, EnricaHUEUE aims to capture the colour story of a movie and present it in an accessible time frame of a minute. Movies at their simplest are colour, sound and motion. HUEUE aims to distil any movie into these basic forms and generate a unique form of escapism, bring the audience on a journey into the movie itself. HUEUE creates a tunnel effect. The effect indicates an impression of a portal. This is intended to give life to the escapism and create a more concrete feeling of the journey with the aid of sequential colours flowing from the movie. The audio is condensed creating a pitch shift, simulating the Doppler effect. All these elements create an experience that accelerate the viewer into the escapism and further into the movie.Item type: Publication , Palate 字觉(Domus Argenia, 2018) Soo, Chin-En Keith; Soddu, Celestino; Colabella, EnricaChinese characters are a visual symbol with strong contagion. Palate uses the pronunciation and character structure of Chinese characters as the entry point to colorize the Chinese characters so as to give them the possibility of expressing colors in the design of Chinese characters. Each Chinese character has its own unique color system. The application of the colorisation of Chinese characters can help to study the artistic charm of Chinese characters from a new perspective, improve the visual impact of Chinese characters, break the limitations of the past in the search for changes in the design of Chinese characters, and seeking a new form of modern Chinese character design.Item type: Publication , ebBe(Domus Argenia, 2021) Soo, Chin-En Keith; Simmons, Rowan; Soddu, Celestino; Colabella, EnricaebBe is a visualisation of tweets in realtime. ebBe uses word frequency to express the notion and motion of an ebb by contracting lines representing creation and decay. The created lines in different qualities will mimic and manifest a live visual artwork.Item type: Item , Chromatic ghost(Domus Argenia, 2024) Soo, Chin-En Keith; Simmons, Rowan; Soddu, Celestino; Colabella, Enrica"Chromatic Ghost" delves into the convergence of artificial intelligence and the emotional narrative of cinema, exploring how palettes can AI-generated colour visually express the emotional depth of films. The project aims to assess the capabilities of AI in understanding and translating human emotions into visual representations, questioning how effectively machines, which are increasingly embedded in our "Chromatic Ghost" is a creative experiment designed to investigate how AI can interpret and visualise human emotion in the context of film. In an era where artificial intelligence is increasingly intertwined with our daily lives, this work examines how well these technologies can grasp and reflect the complexities of human emotions, which are often considered too nuanced or abstract for machines to comprehend. lives, can interpret the intricacies of our emotional experiences. By using AI to generate emotion- based colour palettes and applying them to film frames, "Chromatic Ghost" transforms iconic cinematic scenes into layered, ethereal images that evoke the emotional core of the films in a spectral, dreamlike manner. The resulting visual compositions—ghostly, fragmented, and nuanced—invite viewers to contemplate both films' emotional resonance and AI’s role as a mediator of this experience.Item type: Publication , Co-WARe(Domus Argenia, 2022) Soo, Chin-En Keith; Soddu, Celestino; Colabella, EnricaThroughout history, humans have been creating receptacles in their daily activities to hold, keep, and preserve the rewards and objects they treasure. Applying the same notion, Co-WARe makes unique receptacles from covid data to express the information in an artistic form. Co-WARe is an objectified presentation of all COVID-19 cases, deaths, geographically located data in each country, and the time of the data was generated. These data series project the different changes brought to each country since the beginning of COVID-19. It also provides more intuitive insight into the epidemic in all countries worldwide.Item type: Item , A natural behavior planner for multi-personal human-robot interaction within the simulated environment(Elsevier BV, 2026-03) Chen, Yue; Zheng, Pai; Zhou, Zhiyuan; Soo, Chin-En Keith; Wang, Haining; Yu, ChunyangIn recent years, diffusion models have made remarkable success in generating realistic human motions. However, existing robot pose-learning approaches are largely focused on single-task and one-to-one scenarios, failing to account for multi-person social interactions. This limitation leads to rigid, context-insensitive behaviors that are ill-suited for real-world service scenarios. Consequently, current systems often produce robotic behaviors incapable of the fluidity and responsiveness expected in human-centered environments, a shortcoming underscored by affordance theory in robotics. To address this issue, we propose RoboActor, an innovative human-robot interaction behavior planner that draws inspiration from theatrical acting to orchestrate both deliberate and automatic actions. Our framework leverages large language models (LLMs) to disentangle primary command-driven tasks from secondary, context-induced subtasks. By this means, RoboActor generates lifelike and socially appropriate behaviors in multi-person settings, significantly enhancing the naturalness, engagement, and realism of service robots in everyday social applications.Item type: Item , Parasthetica: Artworks for GA2025(Domus Argenia Publisher, 2025-12) Soo, Chin-En Keith; Simmons, Rowan; Soddu, Celestino; Colabella, EnricaParasthetica transforms eBooks into abstract visual artworks through a character-matching system that maps literary text into chromatic geometric forms. The program positions itself at the crossroads of digital humanities, generative art, and data visualisation. Each text becomes a living artwork, where words are translated into dynamic, colour-based grids that reveal hidden linguistic and structural patterns. Rather than statistical summaries, the visualisations offer immersive aesthetic experiences that mirror the rhythm, genre, and emotional intensity of the source material.Item type: Publication , Augmenting NIR spectra in deep regression to improve calibration(Elsevier, 2023) Wohlers, Mark W.; McGlone, Andrew; Frank, Eibe; Holmes, GeoffreyDeep learning, particularly with convolutional neural networks, shows promise in modelling near-infrared spectroscopy (NIRS), but the lack of robust generalisation across instruments often affects performance in practice. Here, we investigate a method to increase the robustness of this approach. The proposed method involves using a simple data augmentation technique during the training process. The performance of convolutional neural network regression is compared to partial least squares regression (PLSR) using kiwifruit data collected from multiple handheld devices over three seasons and mango data collected from a single device over four seasons. The results suggest that data augmentation for NIR spectra can prevent overfitting. In particular, augmenting the training data to mimic spectra collected over multiple devices results in a neural network model with improved performance over PLSR.Item type: Item , Barlow Twins for semi-supervised learning in NIR spectroscopy(Elsevier BV, 2026-02-15) Wohlers, Mark W. ; McGlone, Andrew; Frank, Eibe; Holmes, GeoffreyNear-infrared (NIR) spectroscopy is a widely used technology in the horticulture industry for non-destructive fruit grading. Partial Least Squares (PLS) regression is the dominant method for producing fruit quality predictions from measured spectra. Alternative deep learning methods have shown promise, but often require large amounts of labelled data to train. This study proposes a semi-supervised method based on Barlow Twins to include unlabelled data in the training process. We adopt the Barlow Twins method by using repeated measurements on the same fruit from different devices as different “views” to encode into the same latent space and combine the encoder network with a regression head for prediction. Our approach demonstrates improved performance over PLS with up to 17% lower RMSE, especially when the labelled data is limited. The Barlow loss function also improves calibration transfer results.Item type: Item , Assessing machine learning models for near-infrared regression by measuring stability towards diffeomorphisms(Elsevier, 2025) Wohlers, Mark W.; McGlone, V.A.; Frank, Eibe; Holmes, GeoffreyNear infrared (NIR) spectroscopy is widely used as a tool for non-destructive assessment of fruit quality by applying measured spectra to predict quality parameters such as dry matter and soluble solids content using a suitable regression method. With continued advancements in deep learning, there is potential for improved predictive performance when neural network models are applied instead of partial least-squares regression, but choosing a model remains challenging as performance is sensitive to the model's architecture. Taking inspiration from work done in image classification, we propose model selection by assessing relative stability to diffeomorphic transformations, providing a complementary approach to standard validation methods. This is particularly useful when labelled validation data is limited. Our empirical results on several NIR regression problems indicate that the proposed approach is comparable to the use of independent validation sets. In addition to the choice of deep learning architecture, we also consider the selection of the number of components in partial least-squares regression to demonstrate the method's generality.