Skip to main content

Showing 1–14 of 14 results for author: Segal, E

  1. arXiv:2403.09672  [pdf, other

    cs.CV cs.LG

    COMPRER: A Multimodal Multi-Objective Pretraining Framework for Enhanced Medical Image Representation

    Authors: Guy Lutsker, Hagai Rossman, Nastya Godiva, Eran Segal

    Abstract: Substantial advances in multi-modal Artificial Intelligence (AI) facilitate the combination of diverse medical modalities to achieve holistic health assessments. We present COMPRER , a novel multi-modal, multi-objective pretraining framework which enhances medical-image representation, diagnostic inferences, and prognosis of diseases. COMPRER employs a multi-objective training framework, where eac… ▽ More

    Submitted 4 February, 2024; originally announced March 2024.

  2. arXiv:2312.07160  [pdf, other

    cs.IR

    Audience Prospecting for Dynamic-Product-Ads in Native Advertising

    Authors: Eliran Abutbul, Yohay Kaplan, Naama Krasne, Oren Somekh, Or David, Omer Duvdevany, Evgeny Segal

    Abstract: With yearly revenue exceeding one billion USD, Yahoo Gemini native advertising marketplace serves more than two billion impressions daily to hundreds of millions of unique users. One of the fastest growing segments of Gemini native is dynamic-product-ads (DPA), where major advertisers, such as Amazon and Walmart, provide catalogs with millions of products for the system to choose from and present… ▽ More

    Submitted 13 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: In Proc. IeeeBigData'2023 (Industry and Government Program)

  3. arXiv:2311.08979  [pdf, other

    cs.LG eess.SP

    A Multimodal Dataset of 21,412 Recorded Nights for Sleep and Respiratory Research

    Authors: Alon Diament, Maria Gorodetski, Adam Jankelow, Ayya Keshet, Tal Shor, Daphna Weissglas-Volkov, Hagai Rossman, Eran Segal

    Abstract: This study introduces a novel, rich dataset obtained from home sleep apnea tests using the FDA-approved WatchPAT-300 device, collected from 7,077 participants over 21,412 nights. The dataset comprises three levels of sleep data: raw multi-channel time-series from sensors, annotated sleep events, and computed summary statistics, which include 447 features related to sleep architecture, sleep apnea,… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 14 pages

  4. arXiv:2306.04971  [pdf, other

    cs.NE cs.LG

    A Melting Pot of Evolution and Learning

    Authors: Moshe Sipper, Achiya Elyasaf, Tomer Halperin, Zvika Haramaty, Raz Lapid, Eyal Segal, Itai Tzruia, Snir Vitrack Tamam

    Abstract: We survey eight recent works by our group, involving the successful blending of evolutionary algorithms with machine learning and deep learning: 1. Binary and Multinomial Classification through Evolutionary Symbolic Regression, 2. Classy Ensemble: A Novel Ensemble Algorithm for Classification, 3. EC-KitY: Evolutionary Computation Tool Kit in Python, 4. Evolution of Activation Functions for Deep Le… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: To Appear in Proceedings of Genetic Programming Theory & Practice XX, 2023

  5. arXiv:2211.00262  [pdf, other

    cs.CL cs.CV

    Training Vision-Language Models with Less Bimodal Supervision

    Authors: Elad Segal, Ben Bogin, Jonathan Berant

    Abstract: Standard practice in pretraining multimodal models, such as vision-language models, is to rely on pairs of aligned inputs from both modalities, for example, aligned image-text pairs. However, such pairs can be difficult to obtain in low-resource settings and for some modality pairs (e.g., structured tables and images). In this work, we investigate the extent to which we can reduce the reliance on… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: AKBC 2022

  6. arXiv:2209.03618  [pdf, other

    cs.NE cs.MA

    Adaptive Combination of a Genetic Algorithm and Novelty Search for Deep Neuroevolution

    Authors: Eyal Segal, Moshe Sipper

    Abstract: Evolutionary Computation (EC) has been shown to be able to quickly train Deep Artificial Neural Networks (DNNs) to solve Reinforcement Learning (RL) problems. While a Genetic Algorithm (GA) is well-suited for exploiting reward functions that are neither deceptive nor sparse, it struggles when the reward function is either of those. To that end, Novelty Search (NS) has been shown to be able to outp… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Journal ref: Proceedings of the 14th International Joint Conference on Computational Intelligence (IJCCI 2022)

  7. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  8. arXiv:2201.03533  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    SCROLLS: Standardized CompaRison Over Long Language Sequences

    Authors: Uri Shaham, Elad Segal, Maor Ivgi, Avia Efrat, Ori Yoran, Adi Haviv, Ankit Gupta, Wenhan Xiong, Mor Geva, Jonathan Berant, Omer Levy

    Abstract: NLP benchmarks have largely focused on short texts, such as sentences and paragraphs, even though long texts comprise a considerable amount of natural language in the wild. We introduce SCROLLS, a suite of tasks that require reasoning over long texts. We examine existing long-text datasets, and handpick ones where the text is naturally long, while prioritizing tasks that involve synthesizing infor… ▽ More

    Submitted 11 October, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: EMNLP 2022

  9. arXiv:2101.02235  [pdf, other

    cs.CL

    Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies

    Authors: Mor Geva, Daniel Khashabi, Elad Segal, Tushar Khot, Dan Roth, Jonathan Berant

    Abstract: A key limitation in current datasets for multi-hop reasoning is that the required steps for answering the question are mentioned in it explicitly. In this work, we introduce StrategyQA, a question answering (QA) benchmark where the required reasoning steps are implicit in the question, and should be inferred using a strategy. A fundamental challenge in this setup is how to elicit such creative que… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Comments: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL), 2021. Author's final version

  10. arXiv:1909.13375  [pdf, other

    cs.CL

    A Simple and Effective Model for Answering Multi-span Questions

    Authors: Elad Segal, Avia Efrat, Mor Shoham, Amir Globerson, Jonathan Berant

    Abstract: Models for reading comprehension (RC) commonly restrict their output space to the set of all single contiguous spans from the input, in order to alleviate the learning problem and avoid the need for a model that generates text explicitly. However, forcing an answer to be a single span can be restrictive, and some recent datasets also include multi-span questions, i.e., questions whose answer is a… ▽ More

    Submitted 5 October, 2020; v1 submitted 29 September, 2019; originally announced September 2019.

    Comments: EMNLP 2020

  11. arXiv:1805.08691  [pdf, other

    cs.CV

    Highly Efficient 8-bit Low Precision Inference of Convolutional Neural Networks with IntelCaffe

    Authors: Jiong Gong, Haihao Shen, Guoming Zhang, Xiaoli Liu, Shane Li, Ge Jin, Niharika Maheshwari, Evarist Fomenko, Eden Segal

    Abstract: High throughput and low latency inference of deep neural networks are critical for the deployment of deep learning applications. This paper presents the efficient inference techniques of IntelCaffe, the first Intel optimized deep learning framework that supports efficient 8-bit low precision inference and model optimization techniques of convolutional neural networks on Intel Xeon Scalable Process… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: 1st Reproducible Tournament on Pareto-efficient Image Classification, co-held with ASPLOS 2018

  12. arXiv:1805.06440  [pdf, other

    stat.ML cs.LG

    Regularization Learning Networks: Deep Learning for Tabular Datasets

    Authors: Ira Shavitt, Eran Segal

    Abstract: Despite their impressive performance, Deep Neural Networks (DNNs) typically underperform Gradient Boosting Trees (GBTs) on many tabular-dataset learning tasks. We propose that applying a different regularization coefficient to each weight might boost the performance of DNNs by allowing them to make more use of the more relevant inputs. However, this will lead to an intractable number of hyperparam… ▽ More

    Submitted 23 October, 2018; v1 submitted 16 May, 2018; originally announced May 2018.

    Comments: Accepted to the 32nd Conference on Neural Information Processing Systems (NIPS 2018), Montreal, Canada

  13. arXiv:1301.2289  [pdf

    cs.AI

    Exact Inference in Networks with Discrete Children of Continuous Parents

    Authors: Uri Lerner, Eran Segal, Daphne Koller

    Abstract: Many real life domains contain a mixture of discrete and continuous variables and can be modeled as hybrid Bayesian Networks. Animportant subclass of hybrid BNs are conditional linear Gaussian (CLG) networks, where the conditional distribution of the continuous variables given an assignment to the discrete variables is a multivariate Gaussian. Lauritzen's extension to the clique tree algorithm can… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-319-328

  14. arXiv:1212.2517  [pdf

    cs.LG cs.CE stat.ML

    Learning Module Networks

    Authors: Eran Segal, Dana Pe'er, Aviv Regev, Daphne Koller, Nir Friedman

    Abstract: Methods for learning Bayesian network structure can discover dependency structure between observed variables, and have been shown to be useful in many applications. However, in domains that involve a large number of variables, the space of possible network structures is enormous, making it difficult, for both computational and statistical reasons, to identify a good model. In this… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-525-534