Skip to main content

Showing 1–25 of 25 results for author: Efrat, A

  1. arXiv:2309.07055  [pdf, other

    cs.CY

    Unraveling the Geography of Infection Spread: Harnessing Super-Agents for Predictive Modeling

    Authors: Amir Mohammad Esmaieeli Sikaroudi, Alon Efrat, Michael Chertkov

    Abstract: Our study presents an intermediate-level modeling approach that bridges the gap between complex Agent-Based Models (ABMs) and traditional compartmental models for infectious diseases. We introduce "super-agents" to simulate infection spread in cities, reducing computational complexity while retaining individual-level interactions. This approach leverages real-world mobility data and strategic geos… ▽ More

    Submitted 9 March, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

  2. arXiv:2305.14196  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding

    Authors: Uri Shaham, Maor Ivgi, Avia Efrat, Jonathan Berant, Omer Levy

    Abstract: We introduce ZeroSCROLLS, a zero-shot benchmark for natural language understanding over long texts, which contains only test and small validation sets, without training data. We adapt six tasks from the SCROLLS benchmark, and add four new datasets, including two novel information fusing tasks, such as aggregating the percentage of positive reviews. Using ZeroSCROLLS, we conduct a comprehensive eva… ▽ More

    Submitted 17 December, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Findings of EMNLP 2023

  3. arXiv:2305.11206  [pdf, other

    cs.CL cs.AI cs.LG

    LIMA: Less Is More for Alignment

    Authors: Chunting Zhou, Pengfei Liu, Puxin Xu, Srini Iyer, Jiao Sun, Yuning Mao, Xuezhe Ma, Avia Efrat, Ping Yu, Lili Yu, Susan Zhang, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer, Omer Levy

    Abstract: Large language models are trained in two stages: (1) unsupervised pretraining from raw text, to learn general-purpose representations, and (2) large scale instruction tuning and reinforcement learning, to better align to end tasks and user preferences. We measure the relative importance of these two stages by training LIMA, a 65B parameter LLaMa language model fine-tuned with the standard supervis… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  4. arXiv:2211.02069  [pdf, other

    cs.CL cs.AI cs.LG

    LMentry: A Language Model Benchmark of Elementary Language Tasks

    Authors: Avia Efrat, Or Honovich, Omer Levy

    Abstract: As the performance of large language models rapidly improves, benchmarks are getting larger and more complex as well. We present LMentry, a benchmark that avoids this "arms race" by focusing on a compact set of tasks that are trivial to humans, e.g. writing a sentence containing a specific word, identifying which words in a list belong to a specific category, or choosing which of two words is long… ▽ More

    Submitted 19 December, 2022; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: minor results updates

  5. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  6. arXiv:2201.03533  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    SCROLLS: Standardized CompaRison Over Long Language Sequences

    Authors: Uri Shaham, Elad Segal, Maor Ivgi, Avia Efrat, Ori Yoran, Adi Haviv, Ankit Gupta, Wenhan Xiong, Mor Geva, Jonathan Berant, Omer Levy

    Abstract: NLP benchmarks have largely focused on short texts, such as sentences and paragraphs, even though long texts comprise a considerable amount of natural language in the wild. We introduce SCROLLS, a suite of tasks that require reasoning over long texts. We examine existing long-text datasets, and handpick ones where the text is naturally long, while prioritizing tasks that involve synthesizing infor… ▽ More

    Submitted 11 October, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: EMNLP 2022

  7. arXiv:2109.04517  [pdf, other

    cs.SI physics.soc-ph

    Prediction and Prevention of Pandemics via Graphical Model Inference and Convex Programming

    Authors: Mikhail Krechetov, Amir Mohammad Esmaieeli Sikaroudi, Alon Efrat, Valentin Polishchuk, Michael Chertkov

    Abstract: Hard-to-predict bursts of COVID-19 pandemic revealed significance of statistical modeling which would resolve spatio-temporal correlations over geographical areas, for example spread of the infection over a city with census tract granularity. In this manuscript, we provide algorithmic answers to the following two inter-related public health challenges. (1) Inference Challenge: assuming that there… ▽ More

    Submitted 26 April, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

  8. arXiv:2108.05857  [pdf, other

    cs.CL

    How Optimal is Greedy Decoding for Extractive Question Answering?

    Authors: Or Castel, Ori Ram, Avia Efrat, Omer Levy

    Abstract: Fine-tuned language models use greedy decoding to answer reading comprehension questions with relative success. However, this approach does not ensure that the answer is a span in the given passage, nor does it guarantee that it is the most probable one. Does greedy decoding actually perform worse than an algorithm that does adhere to these properties? To study the performance and optimality of gr… ▽ More

    Submitted 8 November, 2022; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: AKBC 2022 12 pages, 3 figures

  9. arXiv:2103.01242  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language

    Authors: Avia Efrat, Uri Shaham, Dan Kilman, Omer Levy

    Abstract: Current NLP datasets targeting ambiguity can be solved by a native speaker with relative ease. We present Cryptonite, a large-scale dataset based on cryptic crosswords, which is both linguistically complex and naturally sourced. Each example in Cryptonite is a cryptic clue, a short phrase or sentence with a misleading surface reading, whose solving requires disambiguating semantic, syntactic, and… ▽ More

    Submitted 1 November, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: EMNLP 2021

  10. arXiv:2010.11982  [pdf, other

    cs.CL cs.AI cs.LG

    The Turking Test: Can Language Models Understand Instructions?

    Authors: Avia Efrat, Omer Levy

    Abstract: Supervised machine learning provides the learner with a set of input-output examples of the target task. Humans, however, can also learn to perform new tasks from instructions in natural language. Can machines learn to understand instructions as well? We present the Turking Test, which examines a model's ability to follow natural language instructions of varying complexity. These range from simple… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  11. arXiv:2008.10192  [pdf, other

    cs.CG cs.DM math.CO

    Polygons with Prescribed Angles in 2D and 3D

    Authors: Alon Efrat, Radoslav Fulek, Stephen Kobourov, Csaba D. Tóth

    Abstract: We consider the construction of a polygon $P$ with $n$ vertices whose turning angles at the vertices are given by a sequence $A=(α_0,\ldots, α_{n-1})$, $α_i\in (-π,π)$, for $i\in\{0,\ldots, n-1\}$. The problem of realizing $A$ by a polygon can be seen as that of constructing a straight-line drawing of a graph with prescribed angles at vertices, and hence, it is a special case of the well studied p… ▽ More

    Submitted 1 November, 2020; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: 15 pages, 9 figures, a new section about self-intersecting realizations in 3D

  12. arXiv:2001.08773  [pdf, other

    cs.CR cs.CG cs.DB

    Data Inference from Encrypted Databases: A Multi-dimensional Order-Preserving Matching Approach

    Authors: Yanjun Pan, Alon Efrat, Ming Li, Boyang Wang, Hanyu Quan, Joseph Mitchell, Jie Gao, Esther Arkin

    Abstract: Due to increasing concerns of data privacy, databases are being encrypted before they are stored on an untrusted server. To enable search operations on the encrypted data, searchable encryption techniques have been proposed. Representative schemes use order-preserving encryption (OPE) for supporting efficient Boolean queries on encrypted databases. Yet, recent works showed the possibility of infer… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

    Comments: 11 pages, 4 figures

  13. arXiv:1909.13375  [pdf, other

    cs.CL

    A Simple and Effective Model for Answering Multi-span Questions

    Authors: Elad Segal, Avia Efrat, Mor Shoham, Amir Globerson, Jonathan Berant

    Abstract: Models for reading comprehension (RC) commonly restrict their output space to the set of all single contiguous spans from the input, in order to alleviate the learning problem and avoid the need for a model that generates text explicitly. However, forcing an answer to be a single span can be restrictive, and some recent datasets also include multi-span questions, i.e., questions whose answer is a… ▽ More

    Submitted 5 October, 2020; v1 submitted 29 September, 2019; originally announced September 2019.

    Comments: EMNLP 2020

  14. arXiv:1902.06875  [pdf, other

    cs.CG

    Euclidean TSP, Motorcycle Graphs, and Other New Applications of Nearest-Neighbor Chains

    Authors: Nil Mamano, Alon Efrat, David Eppstein, Daniel Frishberg, Michael Goodrich, Stephen Kobourov, Pedro Matias, Valentin Polishchuk

    Abstract: We show new applications of the nearest-neighbor chain algorithm, a technique that originated in agglomerative hierarchical clustering. We apply it to a diverse class of geometric problems: we construct the greedy multi-fragment tour for Euclidean TSP in $O(n\log n)$ time in any fixed dimension and for Steiner TSP in planar graphs in $O(n\sqrt{n}\log n)$ time; we compute motorcycle graphs (which a… ▽ More

    Submitted 2 December, 2019; v1 submitted 18 February, 2019; originally announced February 2019.

    Comments: 35 pages, 10 figures; v2: minor improvements, added Figure 1, and author order as in paper. v3: added funding acknowledgement

    MSC Class: I.3.5 ACM Class: I.3.5

  15. arXiv:1811.11700  [pdf, other

    cs.DS

    Approximation algorithms for the vertex-weighted grade-of-service Steiner tree problem

    Authors: Faryad Darabi Sahneh, Alon Efrat, Stephen Kobourov, Spencer Krieger, Richard Spence

    Abstract: Given a graph $G = (V,E)$ and a subset $T \subseteq V$ of terminals, a \emph{Steiner tree} of $G$ is a tree that spans $T$. In the vertex-weighted Steiner tree (VST) problem, each vertex is assigned a non-negative weight, and the goal is to compute a minimum weight Steiner tree of $G$. We study a natural generalization of the VST problem motivated by multi-level graph construction, the \emph{ver… ▽ More

    Submitted 3 May, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

  16. arXiv:1804.02627  [pdf, other

    cs.DS

    Multi-Level Steiner Trees

    Authors: Reyan Ahmed, Patrizio Angelini, Faryad Darabi Sahneh, Alon Efrat, David Glickenstein, Martin Gronemann, Niklas Heinsohn, Stephen G. Kobourov, Richard Spence, Joseph Watkins, Alexander Wolff

    Abstract: In the classical Steiner tree problem, given an undirected, connected graph $G=(V,E)$ with non-negative edge costs and a set of \emph{terminals} $T\subseteq V$, the objective is to find a minimum-cost tree $E' \subseteq E$ that spans the terminals. The problem is APX-hard; the best known approximation algorithm has a ratio of $ρ= \ln(4)+\varepsilon < 1.39$. In this paper, we study a natural genera… ▽ More

    Submitted 26 November, 2018; v1 submitted 8 April, 2018; originally announced April 2018.

    Comments: This paper has been accepted in 17th International Symposium on Experimental Algorithms (SEA 2018)

  17. arXiv:1703.01544  [pdf, other

    cs.CG

    L-Graphs and Monotone L-Graphs

    Authors: Abu Reyan Ahmed, Felice De Luca, Sabin Devkota, Alon Efrat, Md Iqbal Hossain, Stephen Kobourov, Jixian Li, Sammi Abida Salma, Eric Welch

    Abstract: In an $\mathsf{L}$-embedding of a graph, each vertex is represented by an $\mathsf{L}$-segment, and two segments intersect each other if and only if the corresponding vertices are adjacent in the graph. If the corner of each $\mathsf{L}$-segment in an $\mathsf{L}$-embedding lies on a straight line, we call it a monotone $\mathsf{L}$-embedding. In this paper we give a full characterization of monot… ▽ More

    Submitted 4 March, 2017; originally announced March 2017.

  18. arXiv:1511.02525  [pdf, other

    cs.DS

    Improved Approximation Algorithms for Relay Placement

    Authors: Alon Efrat, Sándor P. Fekete, Joseph S. B. Mitchell, Valentin Polishchuk, Jukka Suomela

    Abstract: In the relay placement problem the input is a set of sensors and a number $r \ge 1$, the communication range of a relay. In the one-tier version of the problem the objective is to place a minimum number of relays so that between every pair of sensors there is a path through sensors and/or relays such that the consecutive vertices of the path are within distance $r$ if both vertices are relays and… ▽ More

    Submitted 8 November, 2015; originally announced November 2015.

    Comments: 1+29 pages, 12 figures

  19. arXiv:1409.0296  [pdf, other

    cs.CY

    A Mobile Food Recommendation System Based on The Traffic Light Diet

    Authors: Thienne Johnson, Jorge Vergara, Chelsea Doll, Madison Kramer, Gayathri Sundararaman, Harsha Rajendran, Alon Efrat, Melanie Hingle

    Abstract: Innovative, real-time solutions are needed to address the mismatch between the demand for and supply of critical information to inform and motivate diet and health-related behavior change. Research suggests that interventions using mobile health technologies hold great promise for influencing knowledge, attitudes, and behaviors related to energy balance. The objective of this paper is to present i… ▽ More

    Submitted 1 September, 2014; originally announced September 2014.

  20. arXiv:0912.4115  [pdf, other

    cs.NI

    On Channel-Discontinuity-Constraint Routing in Wireless Networks

    Authors: Swaminathan Sankararaman, Alon Efrat, Srinivasan Ramasubramanian, Pankaj K. Agarwal

    Abstract: Multi-channel wireless networks are increasingly being employed as infrastructure networks, e.g. in metro areas. Nodes in these networks frequently employ directional antennas to improve spatial throughput. In such networks, given a source and destination, it is of interest to compute an optimal path and channel assignment on every link in the path such that the path bandwidth is the same as tha… ▽ More

    Submitted 26 February, 2010; v1 submitted 21 December, 2009; originally announced December 2009.

  21. arXiv:0911.4332  [pdf, other

    cs.NI

    Scheduling Sensors for Guaranteed Sparse Coverage

    Authors: Swaminathan Sankararaman, Alon Efrat, Srinivasan Ramasubramanian, Javad Taheri

    Abstract: Sensor networks are particularly applicable to the tracking of objects in motion. For such applications, it may not necessary that the whole region be covered by sensors as long as the uncovered region is not too large. This notion has been formalized by Balasubramanian et.al. as the problem of $κ$-weak coverage. This model of coverage provides guarantees about the regions in which the objects m… ▽ More

    Submitted 26 February, 2010; v1 submitted 23 November, 2009; originally announced November 2009.

  22. arXiv:cs/0605102  [pdf, ps, other

    cs.DS cs.CG

    Restricted Strip Covering and the Sensor Cover Problem

    Authors: Adam L. Buchsbaum, Alon Efrat, Shaili Jain, Suresh Venkatasubramanian, Ke Yi

    Abstract: Given a set of objects with durations (jobs) that cover a base region, can we schedule the jobs to maximize the duration the original region remains covered? We call this problem the sensor cover problem. This problem arises in the context of covering a region with sensors. For example, suppose you wish to monitor activity along a fence by sensors placed at various fixed locations. Each sensor h… ▽ More

    Submitted 23 May, 2006; originally announced May 2006.

    Comments: 14 pages, 6 figures

  23. arXiv:cs/0206018  [pdf, ps, other

    cs.CG

    On Simultaneous Graph Embedding

    Authors: C. A. Duncan, A. Efrat, C. Erten, S. Kobourov, J. S. B. Mitchell

    Abstract: We consider the problem of simultaneous embedding of planar graphs. There are two variants of this problem, one in which the mapping between the vertices of the two graphs is given and another where the mapping is not given. In particular, we show that without mapping, any number of outerplanar graphs can be embedded simultaneously on an $O(n)\times O(n)$ grid, and an outerplanar and general pla… ▽ More

    Submitted 9 September, 2002; v1 submitted 11 June, 2002; originally announced June 2002.

    Comments: 12 pages, 4 figures

    ACM Class: F.2.2; G.2.2

  24. arXiv:cs/0204050  [pdf, ps, other

    cs.CG

    Computing Homotopic Shortest Paths Efficiently

    Authors: Alon Efrat, Stephen G. Kobourov, Anna Lubiw

    Abstract: This paper addresses the problem of finding shortest paths homotopic to a given disjoint set of paths that wind amongst point obstacles in the plane. We present a faster algorithm than previously known.

    Submitted 25 April, 2002; originally announced April 2002.

    Comments: 12 pages, 11 figures

    ACM Class: I.3.5; F.2.2

  25. arXiv:cs/0009013  [pdf, ps, other

    cs.CG

    Pattern Matching for sets of segments

    Authors: Alon Efrat, Piotr Indyk, Suresh Venkatasubramanian

    Abstract: In this paper we present algorithms for a number of problems in geometric pattern matching where the input consist of a collections of segments in the plane. Our work consists of two main parts. In the first, we address problems and measures that relate to collections of orthogonal line segments in the plane. Such collections arise naturally from problems in mapping buildings and robot explorati… ▽ More

    Submitted 22 September, 2000; v1 submitted 19 September, 2000; originally announced September 2000.

    Comments: To appear in the 12 ACM Symposium on Discrete Algorithms, Jan 2001

    ACM Class: F.2.2