Skip to main content

Showing 1–8 of 8 results for author: Safaya, A

  1. arXiv:2407.02486  [pdf, other

    cs.CL cs.AI cs.LG

    Neurocache: Efficient Vector Retrieval for Long-range Language Modeling

    Authors: Ali Safaya, Deniz Yuret

    Abstract: This paper introduces Neurocache, an approach to extend the effective context size of large language models (LLMs) using an external vector cache to store its past states. Like recent vector retrieval approaches, Neurocache uses an efficient k-nearest-neighbor (kNN) algorithm to retrieve relevant past states and incorporate them into the attention process. Neurocache improves upon previous methods… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Long paper, published at the main conference NAACL'24

  2. arXiv:2210.07323  [pdf

    cs.CL cs.LG cs.SD eess.AS

    Experiments on Turkish ASR with Self-Supervised Speech Representation Learning

    Authors: Ali Safaya, Engin Erzin

    Abstract: While the Turkish language is listed among low-resource languages, literature on Turkish automatic speech recognition (ASR) is relatively old. In this report, we present our findings on Turkish ASR with speech representation learning using HUBERT. We investigate pre-training HUBERT for Turkish with large-scale data curated from online resources. We pre-train our model using 6,500 hours of speech d… ▽ More

    Submitted 23 December, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

  3. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  4. arXiv:2203.10123  [pdf, other

    cs.CL

    Event Coreference Resolution for Contentious Politics Events

    Authors: Ali Hürriyetoğlu, Osman Mutlu, Fatih Beyhan, Fırat Duruşan, Ali Safaya, Reyyan Yeniterzi, Erdem Yörük

    Abstract: We propose a dataset for event coreference resolution, which is based on random samples drawn from multiple sources, languages, and countries. Early scholarship on event information collection has not quantified the contribution of event coreference resolution. We prepared and analyzed a representative multilingual corpus and measured the performance and contribution of the state-of-the-art event… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  5. arXiv:2203.01215  [pdf, other

    cs.CL

    Mukayese: Turkish NLP Strikes Back

    Authors: Ali Safaya, Emirhan Kurtuluş, Arda Göktoğan, Deniz Yuret

    Abstract: Having sufficient resources for language X lifts it from the under-resourced languages class, but not necessarily from the under-researched class. In this paper, we address the problem of the absence of organized benchmarks in the Turkish language. We demonstrate that languages such as Turkish are left behind the state-of-the-art in NLP applications. As a solution, we present Mukayese, a set of NL… ▽ More

    Submitted 16 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: Accepted at Findings of ACL 2022 (Camera Ready)

  6. arXiv:2009.03191  [pdf, ps, other

    cs.CL

    COVCOR20 at WNUT-2020 Task 2: An Attempt to Combine Deep Learning and Expert rules

    Authors: Ali Hürriyetoğlu, Ali Safaya, Nelleke Oostdijk, Osman Mutlu, Erdem Yörük

    Abstract: In the scope of WNUT-2020 Task 2, we developed various text classification systems, using deep learning models and one using linguistically informed rules. While both of the deep learning systems outperformed the system using the linguistically informed rules, we found that through the integration of (the output of) the three systems a better performance could be achieved than the standalone perfo… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: Shared task report

  7. arXiv:2007.13184  [pdf, other

    cs.CL

    KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social Media

    Authors: Ali Safaya, Moutasem Abdullatif, Deniz Yuret

    Abstract: In this paper, we describe our approach to utilize pre-trained BERT models with Convolutional Neural Networks for sub-task A of the Multilingual Offensive Language Identification shared task (OffensEval 2020), which is a part of the SemEval 2020. We show that combining CNN with BERT is better than using BERT on its own, and we emphasize the importance of utilizing pre-trained language models for d… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

    Comments: to be published in the proceedings of the 14th International Workshop on Semantic Evaluation (SemEval2020), Association for Computational Linguistics (ACL)

    ACM Class: I.2.7

  8. arXiv:2005.06070  [pdf, ps, other

    cs.CL cs.CY cs.LG

    Automated Extraction of Socio-political Events from News (AESPEN): Workshop and Shared Task Report

    Authors: Ali Hürriyetoğlu, Vanni Zavarella, Hristo Tanev, Erdem Yörük, Ali Safaya, Osman Mutlu

    Abstract: We describe our effort on automated extraction of socio-political events from news in the scope of a workshop and a shared task we organized at Language Resources and Evaluation Conference (LREC 2020). We believe the event extraction studies in computational linguistics and social and political sciences should further support each other in order to enable large scale socio-political event informat… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.