Skip to main content

Showing 1–10 of 10 results for author: Dsouza, A

  1. arXiv:2407.03651  [pdf, other

    cs.CL cs.AI

    Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction

    Authors: Amanda Dsouza, Christopher Glaze, Changho Shin, Frederic Sala

    Abstract: Large language models are prominently used in real-world applications, often tasked with reasoning over large volumes of documents. An exciting development in this space is models boasting extended context capabilities, with some accommodating over 2 million tokens. Such long context model capabilities remain uncertain in production systems, motivating the need to benchmark their performance on re… ▽ More

    Submitted 14 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2302.08823  [pdf, other

    cs.AI cs.DB cs.IR

    Creating Knowledge Graphs for Geographic Data on the Web

    Authors: Elena Demidova, Alishiba Dsouza, Simon Gottschalk, Nicolas Tempelmeier, Ran Yu

    Abstract: Geographic data plays an essential role in various Web, Semantic Web and machine learning applications. OpenStreetMap and knowledge graphs are critical complementary sources of geographic data on the Web. However, data veracity, the lack of integration of geographic and semantic characteristics, and incomplete representations substantially limit the data utility. Verification, enrichment and seman… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Journal ref: SIGWEB Newsl., Winter, Article 4 (Winter 2022), 8 pages

  3. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  4. arXiv:2205.03029  [pdf, other

    cs.IT eess.SP

    Investigation of large-scale extended Granger causality (lsXGC) on synthetic functional MRI data

    Authors: Axel Wismüller, Ali Vosoughi, Adora DSouza, Anas Abidin

    Abstract: It is a challenging research endeavor to infer causal relationships in multivariate observational time-series. Such data may be represented by graphs, where nodes represent time-series, and edges directed causal influence scores between them. If the number of nodes exceeds the number of temporal observations, conventional methods, such as standard Granger causality, are of limited value, because e… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: 10 pages, conference, 2 figures, 1 table. arXiv admin note: substantial text overlap with arXiv:2101.09354

    MSC Class: 94-XX

  5. WorldKG: A World-Scale Geographic Knowledge Graph

    Authors: Alishiba Dsouza, Nicolas Tempelmeier, Ran Yu, Simon Gottschalk, Elena Demidova

    Abstract: OpenStreetMap is a rich source of openly available geographic information. However, the representation of geographic entities, e.g., buildings, mountains, and cities, within OpenStreetMap is highly heterogeneous, diverse, and incomplete. As a result, this rich data source is hardly usable for real-world applications. This paper presents WorldKG -- a new geographic knowledge graph aiming to provide… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    ACM Class: H.0

    Journal ref: 30th ACM International Conference on Information and Knowledge Management (CIKM 2021)

  6. arXiv:2107.13257  [pdf, other

    cs.LG cs.AI

    Towards Neural Schema Alignment for OpenStreetMap and Knowledge Graphs

    Authors: Alishiba Dsouza, Nicolas Tempelmeier, Elena Demidova

    Abstract: OpenStreetMap (OSM) is one of the richest openly available sources of volunteered geographic information. Although OSM includes various geographical entities, their descriptions are highly heterogeneous, incomplete, and do not follow any well-defined ontology. Knowledge graphs can potentially provide valuable semantic information to enrich OSM entities. However, interlinking OSM entities with know… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

  7. arXiv:2009.04681  [pdf, other

    cs.LG cs.IT stat.ML

    Large-scale nonlinear Granger causality: A data-driven, multivariate approach to recovering directed networks from short time-series data

    Authors: Axel Wismüller, Adora M. DSouza, Anas Z. Abidin

    Abstract: To gain insight into complex systems it is a key challenge to infer nonlinear causal directional relations from observational time-series data. Specifically, estimating causal relationships between interacting components in large systems with only short recordings over few temporal observations remains an important, yet unresolved problem. Here, we introduce a large-scale Nonlinear Granger Causali… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: 24 pages, 8 figures

    ACM Class: I.5.1

  8. arXiv:2006.06805  [pdf

    eess.IV cs.CV cs.LG

    Automated Identification of Thoracic Pathology from Chest Radiographs with Enhanced Training Pipeline

    Authors: Adora M. DSouza, Anas Z. Abidin, Axel Wismüller

    Abstract: Chest x-rays are the most common radiology studies for diagnosing lung and heart disease. Hence, a system for automated pre-reporting of pathologic findings on chest x-rays would greatly enhance radiologists' productivity. To this end, we investigate a deep-learning framework with novel training schemes for classification of different thoracic pathology labels from chest x-rays. We use the current… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 6 pages, 1 figure, 2 tables

    ACM Class: I.5.4; I.5.2; I.2.0

    Journal ref: Proc. SPIE 10950, Medical Imaging 2019: Computer-Aided Diagnosis, vol. 10950, p. 109503F, (2019)

  9. arXiv:1802.02427  [pdf, other

    eess.IV cs.CV

    MRI Tumor Segmentation with Densely Connected 3D CNN

    Authors: Lele Chen, Yue Wu, Adora M. DSouza, Anas Z. Abidin, Axel Wismuller, Chenliang Xu

    Abstract: Glioma is one of the most common and aggressive types of primary brain tumors. The accurate segmentation of subcortical brain structures is crucial to the study of gliomas in that it helps the monitoring of the progression of gliomas and aids the evaluation of treatment outcomes. However, the large amount of required human labor makes it difficult to obtain the manually segmented Magnetic Resonanc… ▽ More

    Submitted 9 February, 2018; v1 submitted 18 January, 2018; originally announced February 2018.

  10. arXiv:1407.3809  [pdf

    cs.NE q-bio.NC

    A Framework for Exploring Non-Linear Functional Connectivity and Causality in the Human Brain: Mutual Connectivity Analysis (MCA) of Resting-State Functional MRI with Convergent Cross-Mapping and Non-Metric Clustering

    Authors: Axel Wismüller, Xixi Wang, Adora M. DSouza, Mahesh B. Nagarajan

    Abstract: We present a computational framework for analysis and visualization of non-linear functional connectivity in the human brain from resting state functional MRI (fMRI) data for purposes of recovering the underlying network community structure and exploring causality between network components. Our proposed methodology of non-linear mutual connectivity analysis (MCA) involves two computational steps.… ▽ More

    Submitted 14 July, 2014; originally announced July 2014.

    Comments: Axel Wismüller and Mahesh B. Nagarajan contributed equally to the preparation of this manuscript. Pre-publication draft: 18 pages, 6 figures, 1 table