Skip to main content

Showing 1–34 of 34 results for author: Mathewson, K

  1. arXiv:2405.20956  [pdf, other

    cs.AI cs.CL

    A Robot Walks into a Bar: Can Language Models Serve as Creativity Support Tools for Comedy? An Evaluation of LLMs' Humour Alignment with Comedians

    Authors: Piotr Wojciech Mirowski, Juliette Love, Kory W. Mathewson, Shakir Mohamed

    Abstract: We interviewed twenty professional comedians who perform live shows in front of audiences and who use artificial intelligence in their artistic process as part of 3-hour workshops on ``AI x Comedy'' conducted at the Edinburgh Festival Fringe in August 2023 and online. The workshop consisted of a comedy writing session with large language models (LLMs), a human-computer interaction questionnaire to… ▽ More

    Submitted 3 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: 15 pages, 1 figure, published at ACM FAccT 2024

  2. arXiv:2405.13012  [pdf

    cs.CL cs.AI

    Divergent Creativity in Humans and Large Language Models

    Authors: Antoine Bellemare-Pepin, François Lespinasse, Philipp Thölke, Yann Harel, Kory Mathewson, Jay A. Olson, Yoshua Bengio, Karim Jerbi

    Abstract: The recent surge in the capabilities of Large Language Models (LLMs) has led to claims that they are approaching a level of creativity akin to human capabilities. This idea has sparked a blend of excitement and apprehension. However, a critical piece that has been missing in this discourse is a systematic evaluation of LLM creativity, particularly in comparison to human divergent thinking. To brid… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: First two and last listed authors are corresponding authors. The first two listed authors contributed equally to this work

  3. arXiv:2405.07111  [pdf, other

    cs.CL

    Designing and Evaluating Dialogue LLMs for Co-Creative Improvised Theatre

    Authors: Boyd Branch, Piotr Mirowski, Kory Mathewson, Sophia Ppali, Alexandra Covaci

    Abstract: Social robotics researchers are increasingly interested in multi-party trained conversational agents. With a growing demand for real-world evaluations, our study presents Large Language Models (LLMs) deployed in a month-long live show at the Edinburgh Festival Fringe. This case study investigates human improvisers co-creating with conversational agents in a professional theatre setting. We explore… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 13 pages, 7 figures, accepted for publication at the International Conference on Computational Creativity 2024

  4. arXiv:2211.01480  [pdf, other

    cs.MA cs.CL cs.HC

    Over-communicate no more: Situated RL agents learn concise communication protocols

    Authors: Aleksandra Kalinowska, Elnaz Davoodi, Florian Strub, Kory W Mathewson, Ivana Kajic, Michael Bowling, Todd D Murphey, Patrick M Pilarski

    Abstract: While it is known that communication facilitates cooperation in multi-agent settings, it is unclear how to design artificial agents that can learn to effectively and efficiently communicate with each other. Much research on communication emergence uses reinforcement learning (RL) and explores unsituated communication in one-step referential tasks -- the tasks are not temporally interactive and lac… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  5. arXiv:2210.08085  [pdf, other

    cs.AI q-bio.NC

    Adaptive patch foraging in deep reinforcement learning agents

    Authors: Nathan J. Wispinski, Andrew Butcher, Kory W. Mathewson, Craig S. Chapman, Matthew M. Botvinick, Patrick M. Pilarski

    Abstract: Patch foraging is one of the most heavily studied behavioral optimization challenges in biology. However, despite its importance to biological intelligence, this behavioral optimization problem is understudied in artificial intelligence research. Patch foraging is especially amenable to study given that it has a known optimal solution, which may be difficult to discover given current techniques in… ▽ More

    Submitted 21 April, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR). See: https://openreview.net/pdf?id=a0T3nOP9sB

  6. arXiv:2209.14958  [pdf, other

    cs.HC cs.CL

    Co-Writing Screenplays and Theatre Scripts with Language Models: An Evaluation by Industry Professionals

    Authors: Piotr Mirowski, Kory W. Mathewson, Jaylen Pittman, Richard Evans

    Abstract: Language models are increasingly attracting interest from writers. However, such models lack long-range semantic coherence, limiting their usefulness for longform creative writing. We address this limitation by applying language models hierarchically, in a system we call Dramatron. By building structural context via prompt chaining, Dramatron can generate coherent scripts and screenplays complete… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: 102 pages, 7 figures

  7. arXiv:2207.06958   

    cs.SD cs.LG eess.AS

    Proceedings of the ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts

    Authors: Alice Baird, Panagiotis Tzirakis, Gauthier Gidel, Marco Jiralerspong, Eilif B. Muller, Kory Mathewson, Björn Schuller, Erik Cambria, Dacher Keltner, Alan Cowen

    Abstract: This is the Proceedings of the ICML Expressive Vocalization (ExVo) Competition. The ExVo competition focuses on understanding and generating vocal bursts: laughs, gasps, cries, and other non-verbal vocalizations that are central to emotional expression and communication. ExVo 2022, included three competition tracks using a large-scale dataset of 59,201 vocalizations from 1,702 speakers. The first,… ▽ More

    Submitted 16 August, 2022; v1 submitted 14 July, 2022; originally announced July 2022.

  8. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  9. arXiv:2205.01780  [pdf, other

    eess.AS cs.LG cs.SD

    The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts

    Authors: Alice Baird, Panagiotis Tzirakis, Gauthier Gidel, Marco Jiralerspong, Eilif B. Muller, Kory Mathewson, Björn Schuller, Erik Cambria, Dacher Keltner, Alan Cowen

    Abstract: The ICML Expressive Vocalization (ExVo) Competition is focused on understanding and generating vocal bursts: laughs, gasps, cries, and other non-verbal vocalizations that are central to emotional expression and communication. ExVo 2022, includes three competition tracks using a large-scale dataset of 59,201 vocalizations from 1,702 speakers. The first, ExVo-MultiTask, requires participants to trai… ▽ More

    Submitted 12 July, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

  10. arXiv:2204.09622  [pdf, other

    cs.HC cs.GL cs.LG

    A Brief Guide to Designing and Evaluating Human-Centered Interactive Machine Learning

    Authors: Kory W. Mathewson, Patrick M. Pilarski

    Abstract: Interactive machine learning (IML) is a field of research that explores how to leverage both human and computational abilities in decision making systems. IML represents a collaboration between multiple complementary human and machine intelligent systems working as a team, each with their own unique abilities and limitations. This teamwork might mean that both systems take actions at the same time… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 7 pages, 1 figure, Published at ML Evaluation Standards Workshop at ICLR 2022. arXiv admin note: substantial text overlap with arXiv:1905.06289

  11. arXiv:2203.00715  [pdf, other

    cs.LG cs.AI cs.MA

    Learning Robust Real-Time Cultural Transmission without Human Data

    Authors: Cultural General Intelligence Team, Avishkar Bhoopchand, Bethanie Brownfield, Adrian Collister, Agustin Dal Lago, Ashley Edwards, Richard Everett, Alexandre Frechette, Yanko Gitahy Oliveira, Edward Hughes, Kory W. Mathewson, Piermaria Mendolicchio, Julia Pawar, Miruna Pislar, Alex Platonov, Evan Senter, Sukhdeep Singh, Alexander Zacherl, Lei M. Zhang

    Abstract: Cultural transmission is the domain-general social skill that allows agents to acquire and use information from each other in real-time with high fidelity and recall. In humans, it is the inheritance process that powers cumulative cultural evolution, expanding our skills, tools and knowledge across generations. We provide a method for generating zero-shot, high recall cultural transmission in arti… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  12. arXiv:2111.03146  [pdf, other

    cs.LG cs.SD eess.AS

    Generating Diverse Realistic Laughter for Interactive Art

    Authors: M. Mehdi Afsar, Eric Park, Étienne Paquette, Gauthier Gidel, Kory W. Mathewson, Eilif Muller

    Abstract: We propose an interactive art project to make those rendered invisible by the COVID-19 crisis and its concomitant solitude reappear through the welcome melody of laughter, and connections created and explored through advanced laughter synthesis approaches. However, the unconditional generation of the diversity of human emotional responses in high-quality auditory synthesis remains an open problem,… ▽ More

    Submitted 29 July, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: Presented at Machine Learning for Creativity and Design workshop at NeurIPS 2021, 6 pages

  13. arXiv:2111.02216  [pdf, other

    cs.CL cs.LG cs.MM cs.SD eess.AS

    Automatic Embedding of Stories Into Collections of Independent Media

    Authors: Dylan R. Ashley, Vincent Herrmann, Zachary Friggstad, Kory W. Mathewson, Jürgen Schmidhuber

    Abstract: We look at how machine learning techniques that derive properties of items in a collection of independent media can be used to automatically embed stories into such collections. To do so, we use models that extract the tempo of songs to make a music playlist follow a narrative arc. Our work specifies an open-source tool that uses pre-trained neural network models to extract the global tempo of a s… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 2 pages in main text + 1 page of references + 6 pages of appendices, 2 figures in main text + 3 figures in appendices, 1 algorithm in appendices; source code available at https://gist.github.com/dylanashley/1387a99deb85bfc0bce11286810cd98b

    ACM Class: H.5.5; I.2.6; J.5

  14. arXiv:2110.00116  [pdf

    cs.SI cs.CL cs.LG

    #ContextMatters: Advantages and Limitations of Using Machine Learning to Support Women in Politics

    Authors: Jacqueline Comer, Sam Work, Kory W Mathewson, Lana Cuthbertson, Kasey Machin

    Abstract: The United Nations identified gender equality as a Sustainable Development Goal in 2015, recognizing the underrepresentation of women in politics as a specific barrier to achieving gender equality. Political systems around the world experience gender inequality across all levels of elected government as fewer women run for office than men. This is due in part to online abuse, particularly on socia… ▽ More

    Submitted 10 October, 2021; v1 submitted 30 September, 2021; originally announced October 2021.

    Comments: 21 pages, 1 figure. Presented as Policy and Practice, Problem Pitches poster at EAAMO'21

  15. arXiv:2109.14728  [pdf, other

    cs.HC cs.AI

    Collaborative Storytelling with Human Actors and AI Narrators

    Authors: Boyd Branch, Piotr Mirowski, Kory W. Mathewson

    Abstract: Large language models can be used for collaborative storytelling. In this work we report on using GPT-3 \cite{brown2020language} to co-narrate stories. The AI system must track plot progression and character arcs while the human actors perform scenes. This event report details how a novel conversational agent was employed as creative partner with a team of professional improvisers to explore long-… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: 5 pages, 1 figure, accepted to ICCC as Short Paper: Event Report

  16. arXiv:2106.03982  [pdf, other

    cs.CL

    Expressivity of Emergent Language is a Trade-off between Contextual Complexity and Unpredictability

    Authors: Shangmin Guo, Yi Ren, Kory Mathewson, Simon Kirby, Stefano V. Albrecht, Kenny Smith

    Abstract: Researchers are using deep learning models to explore the emergence of language in various language games, where agents interact and develop an emergent language to solve tasks. We focus on the factors that determine the expressivity of emergent languages, which reflects the amount of information about input spaces those languages are capable of encoding. We measure the expressivity of emergent la… ▽ More

    Submitted 15 March, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: 22 pages, 12 figures, 5 tables

    Journal ref: International Conference on Learning Representation 2022

  17. arXiv:2102.03406  [pdf, other

    cs.AI cs.LG

    Symbolic Behaviour in Artificial Intelligence

    Authors: Adam Santoro, Andrew Lampinen, Kory Mathewson, Timothy Lillicrap, David Raposo

    Abstract: The ability to use symbols is the pinnacle of human intelligence, but has yet to be fully replicated in machines. Here we argue that the path towards symbolically fluent artificial intelligence (AI) begins with a reinterpretation of what symbols are, how they come to exist, and how a system behaves when it uses them. We begin by offering an interpretation of symbols as entities whose meaning is es… ▽ More

    Submitted 21 January, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

  18. arXiv:2012.06161  [pdf, other

    cs.AI cs.HC

    Conceptualization and Framework of Hybrid Intelligence Systems

    Authors: Nikhil Prakash, Kory W. Mathewson

    Abstract: As artificial intelligence (AI) systems are getting ubiquitous within our society, issues related to its fairness, accountability, and transparency are increasing rapidly. As a result, researchers are integrating humans with AI systems to build robust and reliable hybrid intelligence systems. However, a proper conceptualization of these systems does not underpin this rapid growth. This article pro… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 8 pages, 1 figure, HAMLETS (Human And Machine in-the-Loop Evaluation and Learning Strategies) workshop at Thirty-fourth Conference on Neural Information Processing Systems

  19. arXiv:2012.05672  [pdf, other

    cs.LG cs.AI cs.MA

    Imitating Interactive Intelligence

    Authors: Josh Abramson, Arun Ahuja, Iain Barr, Arthur Brussee, Federico Carnevale, Mary Cassin, Rachita Chhaparia, Stephen Clark, Bogdan Damoc, Andrew Dudzik, Petko Georgiev, Aurelia Guy, Tim Harley, Felix Hill, Alden Hung, Zachary Kenton, Jessica Landon, Timothy Lillicrap, Kory Mathewson, Soňa Mokrá, Alistair Muldal, Adam Santoro, Nikolay Savinov, Vikrant Varma, Greg Wayne , et al. (4 additional authors not shown)

    Abstract: A common vision from science fiction is that robots will one day inhabit our physical spaces, sense the world as we do, assist our physical labours, and communicate with us through natural language. Here we study how to design artificial agents that can interact naturally with humans using the simplification of a virtual environment. This setting nevertheless integrates a number of the central cha… ▽ More

    Submitted 20 January, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

  20. arXiv:2012.02875  [pdf, other

    cs.CL

    Inductive Bias and Language Expressivity in Emergent Communication

    Authors: Shangmin Guo, Yi Ren, Agnieszka Słowik, Kory Mathewson

    Abstract: Referential games and reconstruction games are the most common game types for studying emergent languages. We investigate how the type of the language game affects the emergent language in terms of: i) language compositionality and ii) transfer of an emergent language to a task different from its origin, which we refer to as language expressivity. With empirical experiments on a handcrafted symbol… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  21. arXiv:1911.11025  [pdf, other

    cs.SI cs.CL cs.LG

    Women, politics and Twitter: Using machine learning to change the discourse

    Authors: Lana Cuthbertson, Alex Kearney, Riley Dawson, Ashia Zawaduk, Eve Cuthbertson, Ann Gordon-Tighe, Kory W Mathewson

    Abstract: Including diverse voices in political decision-making strengthens our democratic institutions. Within the Canadian political system, there is gender inequality across all levels of elected government. Online abuse, such as hateful tweets, leveled at women engaged in politics contributes to this inequity, particularly tweets focusing on their gender. In this paper, we present ParityBOT: a Twitter b… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: 8 pages, 2 figures. Presented at the NeurIPS Joint Workshop on AI for Social Good at NeurIPS 2019

  22. arXiv:1905.06289  [pdf, ps, other

    cs.HC cs.CY cs.LG

    A Human-Centered Approach to Interactive Machine Learning

    Authors: Kory W. Mathewson

    Abstract: The interactive machine learning (IML) community aims to augment humans' ability to learn and make decisions over time through the development of automated decision-making systems. This interaction represents a collaboration between multiple intelligent systems---humans and machines. A lack of appropriate consideration for the humans involved can lead to problematic system behaviour, and issues of… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

    Comments: 4 pages, 4th Multidisciplinary Conference on Reinforcement Learning and Decision Making

  23. arXiv:1904.03371  [pdf, other

    cs.CL cs.LG

    Evaluating Coherence in Dialogue Systems using Entailment

    Authors: Nouha Dziri, Ehsan Kamalloo, Kory W. Mathewson, Osmar Zaiane

    Abstract: Evaluating open-domain dialogue systems is difficult due to the diversity of possible correct answers. Automatic metrics such as BLEU correlate weakly with human annotations, resulting in a significant bias across different models and datasets. Some researchers resort to human judgment experimentation for assessing response quality, which is expensive, time consuming, and not scalable. Moreover, j… ▽ More

    Submitted 31 March, 2020; v1 submitted 6 April, 2019; originally announced April 2019.

    Comments: 5 pages, 2 figures; NAACL-HLT 2019

  24. Automatically Generating Engaging Presentation Slide Decks

    Authors: Thomas Winters, Kory W. Mathewson

    Abstract: Talented public speakers have thousands of hours of practice. One means of improving public speaking skills is practice through improvisation, e.g. presenting an improvised presentation using an unseen slide deck. We present TEDRIC, a novel system capable of generating coherent slide decks based on a single topic suggestion. It combines semantic word webs with text and image data sources to create… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

    Comments: To appear at EvoMusArt 2019

    MSC Class: 97R40

  25. arXiv:1901.11528  [pdf, other

    cs.HC cs.AI cs.CL cs.LG

    Shaping the Narrative Arc: An Information-Theoretic Approach to Collaborative Dialogue

    Authors: Kory W. Mathewson, Pablo Samuel Castro, Colin Cherry, George Foster, Marc G. Bellemare

    Abstract: We consider the problem of designing an artificial agent capable of interacting with humans in collaborative dialogue to produce creative, engaging narratives. In this task, the goal is to establish universe details, and to collaborate on an interesting story in that universe, through a series of natural dialogue exchanges. Our model can augment any probabilistic conversational agent by allowing i… ▽ More

    Submitted 31 January, 2019; originally announced January 2019.

    Comments: 20 pages, 9 figures

  26. arXiv:1811.03423  [pdf, ps, other

    cs.CY cs.CL cs.HC cs.LG

    dAIrector: Automatic Story Beat Generation through Knowledge Synthesis

    Authors: Markus Eger, Kory W. Mathewson

    Abstract: dAIrector is an automated director which collaborates with humans storytellers for live improvisational performances and writing assistance. dAIrector can be used to create short narrative arcs through contextual plot generation. In this work, we present the system architecture, a quantitative evaluation of design choices, and a case-study usage of the system which provides qualitative feedback fr… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.

    Comments: 10 pages with references, 1 figure. Accepted at Joint Workshop on Intelligent Narrative Technologies and Intelligent Cinematography and Editing at AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE'18). Edmonton, Alberta, Canada

  27. arXiv:1811.01063  [pdf, other

    cs.CL

    Augmenting Neural Response Generation with Context-Aware Topical Attention

    Authors: Nouha Dziri, Ehsan Kamalloo, Kory W. Mathewson, Osmar Zaiane

    Abstract: Sequence-to-Sequence (Seq2Seq) models have witnessed a notable success in generating natural conversational exchanges. Notwithstanding the syntactically well-formed responses generated by these neural network models, they are prone to be acontextual, short and generic. In this work, we introduce a Topical Hierarchical Recurrent Encoder Decoder (THRED), a novel, fully data-driven, multi-turn respon… ▽ More

    Submitted 4 June, 2019; v1 submitted 2 November, 2018; originally announced November 2018.

    Comments: Accepted at ACL 2019 Workshop on NLP for ConvAI (NLP4ConvAI). 8 pages + 4 appendix pages, 6 figures, 9 tables

  28. arXiv:1809.01807  [pdf, other

    cs.AI cs.HC

    Improbotics: Exploring the Imitation Game using Machine Intelligence in Improvised Theatre

    Authors: Kory W. Mathewson, Piotr Mirowski

    Abstract: Theatrical improvisation (impro or improv) is a demanding form of live, collaborative performance. Improv is a humorous and playful artform built on an open-ended narrative structure which simultaneously celebrates effort and failure. It is thus an ideal test bed for the development and deployment of interactive artificial intelligence (AI)-based conversational agents, or artificial improvisors. T… ▽ More

    Submitted 5 September, 2018; originally announced September 2018.

    Comments: 8 pages, 6 figures, AAAI Publications, 2018 Artificial Intelligence and Interactive Digital Entertainment Conference (AIIDE)

  29. Reactive Reinforcement Learning in Asynchronous Environments

    Authors: Jaden B. Travnik, Kory W. Mathewson, Richard S. Sutton, Patrick M. Pilarski

    Abstract: The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision Processes (SMDP), do not capture the fact that, in an asynchronous environment, the state of the environment may change during computation perfor… ▽ More

    Submitted 16 February, 2018; originally announced February 2018.

    Comments: 11 pages, 7 figures, currently under journal peer review

  30. arXiv:1711.08819  [pdf, other

    cs.AI

    Improvised Comedy as a Turing Test

    Authors: Kory Wallace Mathewson, Piotr Mirowski

    Abstract: The best improvisational theatre actors can make any scene partner, of any skill level or ability, appear talented and proficient in the art form, and thus "make them shine". To challenge this improvisational paradigm, we built an artificial intelligence (AI) trained to perform live shows alongside human actors for human audiences. Over the course of 30 performances to a combined audience of almos… ▽ More

    Submitted 1 December, 2017; v1 submitted 23 November, 2017; originally announced November 2017.

    Comments: 4 pages, 3 figures. Presented at 31st Conference on Neural Information Processing Systems 2017. Workshop on Machine Learning for Creativity and Design

  31. arXiv:1711.03676  [pdf, other

    cs.AI cs.HC cs.LG

    Communicative Capital for Prosthetic Agents

    Authors: Patrick M. Pilarski, Richard S. Sutton, Kory W. Mathewson, Craig Sherstan, Adam S. R. Parker, Ann L. Edwards

    Abstract: This work presents an overarching perspective on the role that machine intelligence can play in enhancing human abilities, especially those that have been diminished due to injury or illness. As a primary contribution, we develop the hypothesis that assistive devices, and specifically artificial arms and hands, can and should be viewed as agents in order for us to most effectively improve their co… ▽ More

    Submitted 9 November, 2017; originally announced November 2017.

    Comments: 33 pages, 10 figures; unpublished technical report undergoing peer review

  32. arXiv:1703.01274  [pdf, other

    cs.AI cs.HC cs.RO

    Actor-Critic Reinforcement Learning with Simultaneous Human Control and Feedback

    Authors: Kory W. Mathewson, Patrick M. Pilarski

    Abstract: This paper contributes a first study into how different human users deliver simultaneous control and feedback signals during human-robot interaction. As part of this work, we formalize and present a general interactive learning framework for online cooperation between humans and reinforcement learning agents. In many human-machine interaction settings, there is a growing gap between the degrees-of… ▽ More

    Submitted 15 March, 2017; v1 submitted 3 March, 2017; originally announced March 2017.

    Comments: 10 pages, 2 pages of references, 8 figures. Under review for the 34th International Conference on Machine Learning, Sydney, Australia, 2017. Copyright 2017 by the authors

  33. arXiv:1701.02369  [pdf, other

    cs.HC cs.AI cs.RO

    Reinforcement Learning based Embodied Agents Modelling Human Users Through Interaction and Multi-Sensory Perception

    Authors: Kory W. Mathewson, Patrick M. Pilarski

    Abstract: This paper extends recent work in interactive machine learning (IML) focused on effectively incorporating human feedback. We show how control and feedback signals complement each other in systems which model human reward. We demonstrate that simultaneously incorporating human control and feedback signals can improve interactive robotic systems' performance on a self-mirrored movement control task… ▽ More

    Submitted 26 January, 2017; v1 submitted 9 January, 2017; originally announced January 2017.

    Comments: 4 pages, 2 figures, Accepted at the 2017 AAAI Spring Symposium on Interactive Multi-Sensory Object Perception for Embodied Agents

  34. arXiv:1606.06979  [pdf

    cs.HC cs.AI cs.RO

    Simultaneous Control and Human Feedback in the Training of a Robotic Agent with Actor-Critic Reinforcement Learning

    Authors: Kory W. Mathewson, Patrick M. Pilarski

    Abstract: This paper contributes a preliminary report on the advantages and disadvantages of incorporating simultaneous human control and feedback signals in the training of a reinforcement learning robotic agent. While robotic human-machine interfaces have become increasingly complex in both form and function, control remains challenging for users. This has resulted in an increasing gap between user contro… ▽ More

    Submitted 22 June, 2016; originally announced June 2016.

    Comments: 7 pages, 3 figures, Accepted at the Interactive Machine Learning Workshop at IJCAI 2016 (IML): Connecting Humans and Machines