Skip to main content

Showing 1–36 of 36 results for author: Constant, N

  1. arXiv:2404.03626  [pdf, other

    cs.CL cs.LG

    Training LLMs over Neurally Compressed Text

    Authors: Brian Lester, Jaehoon Lee, Alex Alemi, Jeffrey Pennington, Adam Roberts, Jascha Sohl-Dickstein, Noah Constant

    Abstract: In this paper, we explore the idea of training large language models (LLMs) over highly compressed text. While standard subword tokenizers compress text by a small factor, neural text compressors can achieve much higher rates of compression. If it were possible to train LLMs directly over neurally compressed text, this would confer advantages in training and serving efficiency, as well as easier h… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  2. arXiv:2401.17181  [pdf, other

    cs.CL

    Transfer Learning for Text Diffusion Models

    Authors: Kehang Han, Kathleen Kenealy, Aditya Barua, Noah Fiedel, Noah Constant

    Abstract: In this report, we explore the potential for text diffusion to replace autoregressive (AR) decoding for the training and deployment of large language models (LLMs). We are particularly interested to see whether pretrained AR models can be transformed into text diffusion models through a lightweight adaptation procedure we call ``AR2Diff''. We begin by establishing a strong baseline setup for train… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  3. arXiv:2312.06585  [pdf, other

    cs.LG

    Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

    Authors: Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron , et al. (16 additional authors not shown)

    Abstract: Fine-tuning language models~(LMs) on human-generated data remains a prevalent practice. However, the performance of such models is often limited by the quantity and diversity of high-quality human data. In this paper, we explore whether we can go beyond human data on tasks where we have access to scalar feedback, for example, on math problems where one can verify correctness. To do so, we investig… ▽ More

    Submitted 17 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted to TMLR. Camera-ready version. First three authors contributed equally

  4. arXiv:2311.07587  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

    Authors: C. Daniel Freeman, Laura Culp, Aaron Parisi, Maxwell L Bileschi, Gamaleldin F Elsayed, Alex Rizkowsky, Isabelle Simpson, Alex Alemi, Azade Nova, Ben Adlam, Bernd Bohnet, Gaurav Mishra, Hanie Sedghi, Igor Mordatch, Izzeddin Gur, Jaehoon Lee, JD Co-Reyes, Jeffrey Pennington, Kelvin Xu, Kevin Swersky, Kshiteej Mahajan, Lechao Xiao, Rosanne Liu, Simon Kornblith, Noah Constant , et al. (5 additional authors not shown)

    Abstract: We introduce and study the problem of adversarial arithmetic, which provides a simple yet challenging testbed for language model alignment. This problem is comprised of arithmetic questions posed in natural language, with an arbitrary adversarial string inserted before the question is complete. Even in the simple setting of 1-digit addition problems, it is easy to find adversarial prompts that mak… ▽ More

    Submitted 15 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

  5. arXiv:2310.03214  [pdf, other

    cs.CL

    FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation

    Authors: Tu Vu, Mohit Iyyer, Xuezhi Wang, Noah Constant, Jerry Wei, Jason Wei, Chris Tar, Yun-Hsuan Sung, Denny Zhou, Quoc Le, Thang Luong

    Abstract: Most large language models (LLMs) are trained once and never updated; thus, they lack the ability to dynamically adapt to our ever-changing world. In this work, we perform a detailed study of the factuality of LLM-generated text in the context of answering questions that test current world knowledge. Specifically, we introduce FreshQA, a novel dynamic QA benchmark encompassing a diverse range of q… ▽ More

    Submitted 22 November, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: Preprint, 26 pages, 10 figures, 5 tables; Added FreshEval

  6. arXiv:2304.09151  [pdf, other

    cs.CL

    UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining

    Authors: Hyung Won Chung, Noah Constant, Xavier Garcia, Adam Roberts, Yi Tay, Sharan Narang, Orhan Firat

    Abstract: Pretrained multilingual large language models have typically used heuristic temperature-based sampling to balance between different languages. However previous work has not systematically evaluated the efficacy of different pretraining language distributions across model scales. In this paper, we propose a new sampling method, UniMax, that delivers more uniform coverage of head languages while mit… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  7. arXiv:2212.10562  [pdf, other

    cs.CL cs.CV

    Character-Aware Models Improve Visual Text Rendering

    Authors: Rosanne Liu, Dan Garrette, Chitwan Saharia, William Chan, Adam Roberts, Sharan Narang, Irina Blok, RJ Mical, Mohammad Norouzi, Noah Constant

    Abstract: Current image generation models struggle to reliably produce well-formed visual text. In this paper, we investigate a key contributing factor: popular text-to-image models lack character-level input features, making it much harder to predict a word's visual makeup as a series of glyphs. To quantify this effect, we conduct a series of experiments comparing character-aware vs. character-blind text e… ▽ More

    Submitted 3 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  8. arXiv:2210.00193  [pdf, other

    cs.CL

    FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation

    Authors: Parker Riley, Timothy Dozat, Jan A. Botha, Xavier Garcia, Dan Garrette, Jason Riesa, Orhan Firat, Noah Constant

    Abstract: We present FRMT, a new dataset and evaluation benchmark for Few-shot Region-aware Machine Translation, a type of style-targeted translation. The dataset consists of professional translations from English into two regional variants each of Portuguese and Mandarin Chinese. Source documents are selected to enable detailed analysis of phenomena of interest, including lexically distinct terms and distr… ▽ More

    Submitted 3 October, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: Published in TACL Vol. 11 (2023)

  9. arXiv:2209.14500  [pdf, other

    cs.LG cs.CL

    Bidirectional Language Models Are Also Few-shot Learners

    Authors: Ajay Patel, Bryan Li, Mohammad Sadegh Rasooli, Noah Constant, Colin Raffel, Chris Callison-Burch

    Abstract: Large language models such as GPT-3 (Brown et al., 2020) can perform arbitrary tasks without undergoing fine-tuning after being prompted with only a few labeled examples. An arbitrary task can be reformulated as a natural language prompt, and a language model can be asked to generate the completion, indirectly performing the task in a paradigm known as prompt-based learning. To date, emergent prom… ▽ More

    Submitted 5 February, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: To appear at ICLR 2023

  10. arXiv:2208.05577  [pdf, other

    cs.CL

    Reducing Retraining by Recycling Parameter-Efficient Prompts

    Authors: Brian Lester, Joshua Yurtsever, Siamak Shakeri, Noah Constant

    Abstract: Parameter-efficient methods are able to use a single frozen pre-trained large language model (LLM) to perform many tasks by learning task-specific soft prompts that modulate model behavior when concatenated to the input text. However, these learned prompts are tightly coupled to a given frozen model -- if the model is updated, corresponding new prompts need to be obtained. In this work, we propose… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

  11. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  12. arXiv:2205.12647  [pdf, other

    cs.CL

    Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation

    Authors: Tu Vu, Aditya Barua, Brian Lester, Daniel Cer, Mohit Iyyer, Noah Constant

    Abstract: In this paper, we explore the challenging problem of performing a generative task in a target language when labeled data is only available in English, using summarization as a case study. We assume a strict setting with no access to parallel data or machine translation and find that common transfer learning approaches struggle in this setting, as a generative multilingual model fine-tuned purely o… ▽ More

    Submitted 23 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted as a main conference paper at EMNLP 2022, 22 pages, 8 figures, 11 tables

  13. arXiv:2110.07904  [pdf, other

    cs.CL

    SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer

    Authors: Tu Vu, Brian Lester, Noah Constant, Rami Al-Rfou, Daniel Cer

    Abstract: There has been growing interest in parameter-efficient methods to apply pre-trained language models to downstream tasks. Building on the Prompt Tuning approach of Lester et al. (2021), which learns task-specific soft prompts to condition a frozen pre-trained model to perform different tasks, we propose a novel prompt-based transfer learning approach called SPoT: Soft Prompt Transfer. SPoT first le… ▽ More

    Submitted 16 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Accepted as a main conference paper at ACL 2022, 21 pages, 8 figures, 7 tables

  14. arXiv:2108.08877  [pdf, other

    cs.CL

    Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models

    Authors: Jianmo Ni, Gustavo Hernández Ábrego, Noah Constant, Ji Ma, Keith B. Hall, Daniel Cer, Yinfei Yang

    Abstract: We provide the first exploration of sentence embeddings from text-to-text transformers (T5). Sentence embeddings are broadly useful for language processing tasks. While T5 achieves impressive performance on language tasks cast as sequence-to-sequence mapping problems, it is unclear how to produce sentence embeddings from encoder-decoder models. We investigate three methods for extracting T5 senten… ▽ More

    Submitted 14 December, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

  15. arXiv:2107.14749  [pdf, other

    cs.CL

    Towards Universality in Multilingual Text Rewriting

    Authors: Xavier Garcia, Noah Constant, Mandy Guo, Orhan Firat

    Abstract: In this work, we take the first steps towards building a universal rewriter: a model capable of rewriting text in any language to exhibit a wide variety of attributes, including styles and languages, while preserving as much of the original semantics as possible. In addition to obtaining state-of-the-art results on unsupervised translation, we also demonstrate the ability to do zero-shot sentiment… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

  16. arXiv:2106.02171  [pdf, other

    cs.CL

    nmT5 -- Is parallel data still relevant for pre-training massively multilingual language models?

    Authors: Mihir Kale, Aditya Siddhant, Noah Constant, Melvin Johnson, Rami Al-Rfou, Linting Xue

    Abstract: Recently, mT5 - a massively multilingual version of T5 - leveraged a unified text-to-text format to attain state-of-the-art results on a wide variety of multilingual NLP tasks. In this paper, we investigate the impact of incorporating parallel data into mT5 pre-training. We find that multi-tasking language modeling with objectives such as machine translation during pre-training is a straightforwar… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: Accepted at ACL-IJCNLP 2021

  17. arXiv:2105.13626  [pdf, other

    cs.CL

    ByT5: Towards a token-free future with pre-trained byte-to-byte models

    Authors: Linting Xue, Aditya Barua, Noah Constant, Rami Al-Rfou, Sharan Narang, Mihir Kale, Adam Roberts, Colin Raffel

    Abstract: Most widely-used pre-trained language models operate on sequences of tokens corresponding to word or subword units. By comparison, token-free models that operate directly on raw text (bytes or characters) have many benefits: they can process text in any language out of the box, they are more robust to noise, and they minimize technical debt by removing complex and error-prone text preprocessing pi… ▽ More

    Submitted 7 March, 2022; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: To be published in TACL 2022

  18. arXiv:2104.08691  [pdf, other

    cs.CL

    The Power of Scale for Parameter-Efficient Prompt Tuning

    Authors: Brian Lester, Rami Al-Rfou, Noah Constant

    Abstract: In this work, we explore "prompt tuning", a simple yet effective mechanism for learning "soft prompts" to condition frozen language models to perform specific downstream tasks. Unlike the discrete text prompts used by GPT-3, soft prompts are learned through backpropagation and can be tuned to incorporate signal from any number of labeled examples. Our end-to-end learned approach outperforms GPT-3'… ▽ More

    Submitted 2 September, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: Accepted to EMNLP 2021

  19. arXiv:2104.07412  [pdf, other

    cs.CL cs.AI

    XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation

    Authors: Sebastian Ruder, Noah Constant, Jan Botha, Aditya Siddhant, Orhan Firat, Jinlan Fu, Pengfei Liu, Junjie Hu, Dan Garrette, Graham Neubig, Melvin Johnson

    Abstract: Machine learning has brought striking advances in multilingual natural language processing capabilities over the past year. For example, the latest techniques have improved the state-of-the-art performance on the XTREME multilingual benchmark by more than 13 points. While a sizeable gap to human-level performance remains, improvements have been easier to achieve in some tasks than in others. This… ▽ More

    Submitted 7 October, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 camera-ready

  20. arXiv:2103.06799  [pdf, other

    cs.CL

    Towards Continual Learning for Multilingual Machine Translation via Vocabulary Substitution

    Authors: Xavier Garcia, Noah Constant, Ankur P. Parikh, Orhan Firat

    Abstract: We propose a straightforward vocabulary adaptation scheme to extend the language capacity of multilingual machine translation models, paving the way towards efficient continual learning for multilingual machine translation. Our approach is suitable for large-scale datasets, applies to distant languages with unseen scripts, incurs only minor degradation on the translation performance for the origin… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted at NAACL 2021

  21. arXiv:2010.12008  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Zero-Shot Multilingual Synthetic Question and Answer Generation for Cross-Lingual Reading Comprehension

    Authors: Siamak Shakeri, Noah Constant, Mihir Sanjay Kale, Linting Xue

    Abstract: We propose a simple method to generate multilingual question and answer pairs on a large scale through the use of a single generative model. These synthetic samples can be used to improve the zero-shot performance of multilingual QA models on target languages. Our proposed multi-task training of the generative model only requires the labeled training samples in English, thus removing the need for… ▽ More

    Submitted 28 May, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

  22. arXiv:2010.11934  [pdf, other

    cs.CL

    mT5: A massively multilingual pre-trained text-to-text transformer

    Authors: Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-Rfou, Aditya Siddhant, Aditya Barua, Colin Raffel

    Abstract: The recent "Text-to-Text Transfer Transformer" (T5) leveraged a unified text-to-text format and scale to attain state-of-the-art results on a wide variety of English-language NLP tasks. In this paper, we introduce mT5, a multilingual variant of T5 that was pre-trained on a new Common Crawl-based dataset covering 101 languages. We detail the design and modified training of mT5 and demonstrate its s… ▽ More

    Submitted 11 March, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

  23. arXiv:2010.03802  [pdf, other

    cs.CL cs.LG

    TextSETTR: Few-Shot Text Style Extraction and Tunable Targeted Restyling

    Authors: Parker Riley, Noah Constant, Mandy Guo, Girish Kumar, David Uthus, Zarana Parekh

    Abstract: We present a novel approach to the problem of text style transfer. Unlike previous approaches requiring style-labeled training data, our method makes use of readily-available unlabeled text by relying on the implicit connection in style between adjacent sentences, and uses labeled data only at inference time. We adapt T5 (Raffel et al., 2020), a strong pretrained text-to-text model, to extract a s… ▽ More

    Submitted 23 June, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

  24. arXiv:2005.02507  [pdf, other

    cs.CL cs.LG

    MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models

    Authors: Mandy Guo, Yinfei Yang, Daniel Cer, Qinlan Shen, Noah Constant

    Abstract: Retrieval question answering (ReQA) is the task of retrieving a sentence-level answer to a question from an open corpus (Ahmad et al.,2019).This paper presents MultiReQA, anew multi-domain ReQA evaluation suite com-posed of eight retrieval QA tasks drawn from publicly available QA datasets. We provide the first systematic retrieval based evaluation over these datasets using two supervised neural m… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

  25. arXiv:2004.05484  [pdf, other

    cs.CL cs.LG

    LAReQA: Language-agnostic answer retrieval from a multilingual pool

    Authors: Uma Roy, Noah Constant, Rami Al-Rfou, Aditya Barua, Aaron Phillips, Yinfei Yang

    Abstract: We present LAReQA, a challenging new benchmark for language-agnostic answer retrieval from a multilingual candidate pool. Unlike previous cross-lingual tasks, LAReQA tests for "strong" cross-lingual alignment, requiring semantically related cross-language pairs to be closer in representation space than unrelated same-language pairs. Building on multilingual BERT (mBERT), we study different strateg… ▽ More

    Submitted 11 April, 2020; originally announced April 2020.

  26. arXiv:1908.10322  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Bridging the Gap for Tokenizer-Free Language Models

    Authors: Dokook Choe, Rami Al-Rfou, Mandy Guo, Heeyoung Lee, Noah Constant

    Abstract: Purely character-based language models (LMs) have been lagging in quality on large scale datasets, and current state-of-the-art LMs rely on word tokenization. It has been assumed that injecting the prior knowledge of a tokenizer into the model is essential to achieving competitive results. In this paper, we show that contrary to this conventional wisdom, tokenizer-free LMs with sufficient capacity… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

  27. ReQA: An Evaluation for End-to-End Answer Retrieval Models

    Authors: Amin Ahmad, Noah Constant, Yinfei Yang, Daniel Cer

    Abstract: Popular QA benchmarks like SQuAD have driven progress on the task of identifying answer spans within a specific passage, with models now surpassing human performance. However, retrieving relevant answers from a huge corpus of documents is still a challenging problem, and places different requirements on the model architecture. There is growing interest in developing scalable answer retrieval model… ▽ More

    Submitted 3 October, 2019; v1 submitted 10 July, 2019; originally announced July 2019.

  28. arXiv:1907.04307  [pdf, other

    cs.CL

    Multilingual Universal Sentence Encoder for Semantic Retrieval

    Authors: Yinfei Yang, Daniel Cer, Amin Ahmad, Mandy Guo, Jax Law, Noah Constant, Gustavo Hernandez Abrego, Steve Yuan, Chris Tar, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: We introduce two pre-trained retrieval focused multilingual sentence encoding models, respectively based on the Transformer and CNN model architectures. The models embed text from 16 languages into a single semantic space using a multi-task trained dual-encoder that learns tied representations using translation based bridge tasks (Chidambaram al., 2018). The models provide performance that is comp… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: 6 pages, 6 tables, 2 listings, and 1 figure

  29. arXiv:1808.04444  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Character-Level Language Modeling with Deeper Self-Attention

    Authors: Rami Al-Rfou, Dokook Choe, Noah Constant, Mandy Guo, Llion Jones

    Abstract: LSTMs and other RNN variants have shown strong performance on character-level language modeling. These models are typically trained using truncated backpropagation through time, and it is common to assume that their success stems from their ability to remember long-term contexts. In this paper, we show that a deep (64-layer) transformer model with fixed context outperforms RNN variants by a large… ▽ More

    Submitted 10 December, 2018; v1 submitted 9 August, 2018; originally announced August 2018.

    Comments: 8 pages, 7 figures

  30. arXiv:1807.11906  [pdf, other

    cs.CL

    Effective Parallel Corpus Mining using Bilingual Sentence Embeddings

    Authors: Mandy Guo, Qinlan Shen, Yinfei Yang, Heming Ge, Daniel Cer, Gustavo Hernandez Abrego, Keith Stevens, Noah Constant, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: This paper presents an effective approach for parallel corpus mining using bilingual sentence embeddings. Our embedding models are trained to produce similar representations exclusively for bilingual sentence pairs that are translations of each other. This is achieved using a novel training method that introduces hard negatives consisting of sentences that are not translations but that have some d… ▽ More

    Submitted 2 August, 2018; v1 submitted 31 July, 2018; originally announced July 2018.

  31. arXiv:1804.07754  [pdf, other

    cs.CL

    Learning Semantic Textual Similarity from Conversations

    Authors: Yinfei Yang, Steve Yuan, Daniel Cer, Sheng-yi Kong, Noah Constant, Petr Pilar, Heming Ge, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: We present a novel approach to learn representations for sentence-level semantic similarity using conversational data. Our method trains an unsupervised model to predict conversational input-response pairs. The resulting sentence embeddings perform well on the semantic textual similarity (STS) benchmark and SemEval 2017's Community Question Answering (CQA) question similarity subtask. Performance… ▽ More

    Submitted 20 April, 2018; originally announced April 2018.

    Comments: 10 pages, 8 Figures, 6 Tables

  32. arXiv:1803.11175  [pdf, other

    cs.CL

    Universal Sentence Encoder

    Authors: Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua, Nicole Limtiaco, Rhomni St. John, Noah Constant, Mario Guajardo-Cespedes, Steve Yuan, Chris Tar, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity, r… ▽ More

    Submitted 12 April, 2018; v1 submitted 29 March, 2018; originally announced March 2018.

    Comments: 7 pages; fixed module URL in Listing 1

  33. arXiv:1712.09347  [pdf, other

    cs.CY

    Smart Fog: Fog Computing Framework for Unsupervised Clustering Analytics in Wearable Internet of Things

    Authors: Debanjan Borthakur, Harishchandra Dubey, Nicholas Constant, Leslie Mahler, Kunal Mankodiya

    Abstract: The increasing use of wearables in smart telehealth generates heterogeneous medical big data. Cloud and fog services process these data for assisting clinical procedures. IoT based ehealthcare have greatly benefited from efficient data processing. This paper proposed and evaluated use of low resource machine learning on Fog devices kept close to the wearables for smart healthcare. In state of the… ▽ More

    Submitted 24 December, 2017; originally announced December 2017.

    Comments: 5 pages, 3 figures. 5th IEEE Global Conference on Signal and Information Processing GlobalSIP 2017

  34. Fog Computing in Medical Internet-of-Things: Architecture, Implementation, and Applications

    Authors: Harishchandra Dubey, Admir Monteiro, Nicholas Constant, Mohammadreza Abtahi, Debanjan Borthakur, Leslie Mahler, Yan Sun, Qing Yang, Umer Akbar, Kunal Mankodiya

    Abstract: In the era when the market segment of Internet of Things (IoT) tops the chart in various business reports, it is apparently envisioned that the field of medicine expects to gain a large benefit from the explosion of wearables and internet-connected sensors that surround us to acquire and communicate unprecedented data on symptoms, medication, food intake, and daily-life activities impacting one's… ▽ More

    Submitted 24 June, 2017; originally announced June 2017.

    Comments: 29 pages, 30 figures, 5 tables. Keywords: Big Data, Body Area Network, Body Sensor Network, Edge Computing, Fog Computing, Medical Cyberphysical Systems, Medical Internet-of-Things, Telecare, Tele-treatment, Wearable Devices, Chapter in Handbook of Large-Scale Distributed Computing in Smart Healthcare (2017), Springer

  35. arXiv:1701.08680  [pdf, other

    cs.DC cs.CY cs.NI

    Fog-Assisted wIoT: A Smart Fog Gateway for End-to-End Analytics in Wearable Internet of Things

    Authors: Nicholas Constant, Debanjan Borthakur, Mohammadreza Abtahi, Harishchandra Dubey, Kunal Mankodiya

    Abstract: Today, wearable internet-of-things (wIoT) devices continuously flood the cloud data centers at an enormous rate. This increases a demand to deploy an edge infrastructure for computing, intelligence, and storage close to the users. The emerging paradigm of fog computing could play an important role to make wIoT more efficient and affordable. Fog computing is known as the cloud on the ground. This p… ▽ More

    Submitted 24 January, 2017; originally announced January 2017.

    Comments: 5 pages, 4 figures, The 23rd IEEE Symposium on High Performance Computer Architecture HPCA 2017, (Feb. 4, 2017 - Feb. 8, 2017), Austin, Texas, USA

  36. Fog Data: Enhancing Telehealth Big Data Through Fog Computing

    Authors: Harishchandra Dubey, Jing Yang, Nick Constant, Amir Mohammad Amiri, Qing Yang, Kunal Makodiya

    Abstract: The size of multi-modal, heterogeneous data collected through various sensors is growing exponentially. It demands intelligent data reduction, data mining and analytics at edge devices. Data compression can reduce the network bandwidth and transmission power consumed by edge devices. This paper proposes, validates and evaluates Fog Data, a service-oriented architecture for Fog computing. The cente… ▽ More

    Submitted 1 June, 2016; v1 submitted 30 May, 2016; originally announced May 2016.

    Comments: 6 pages, 4 figures in ASE BD&SI '15 Proceedings of the ASE BigData & SocialInformatics 2015, ACM, NY