Activity
-
When we launched Command R+, we grew a fanbase in Japan with our SOTA multilingual capabilities. Today, I'm excited to share that we're doubling…
When we launched Command R+, we grew a fanbase in Japan with our SOTA multilingual capabilities. Today, I'm excited to share that we're doubling…
Shared by Ivan Zhang
-
I'm thrilled to announce that I've recently been promoted to Grandmaster tier in League of Legends, placing me among the top 1000 players in North…
I'm thrilled to announce that I've recently been promoted to Grandmaster tier in League of Legends, placing me among the top 1000 players in North…
Liked by Ivan Zhang
-
Honored to attend the Edge Security Summit hosted by GBEF. Gaining valuable insights on security, emerging tech, AI, and our collective roles in…
Honored to attend the Edge Security Summit hosted by GBEF. Gaining valuable insights on security, emerging tech, AI, and our collective roles in…
Liked by Ivan Zhang
Experience & Education
Publications
-
Targeted Dropout
NIPS CDNNRIA Workshop
Neural networks are extremely flexible models due to their large number of parameters, which is beneficial for learning, but also highly redundant. This makes it possible to compress neural networks without having a drastic effect on performance. We introduce targeted dropout, a strategy for post hoc pruning of neural network weights and units that builds the pruning mechanism directly into learning. At each weight update, targeted dropout selects a candidate set for pruning using a simple…
Neural networks are extremely flexible models due to their large number of parameters, which is beneficial for learning, but also highly redundant. This makes it possible to compress neural networks without having a drastic effect on performance. We introduce targeted dropout, a strategy for post hoc pruning of neural network weights and units that builds the pruning mechanism directly into learning. At each weight update, targeted dropout selects a candidate set for pruning using a simple selection criterion, and then stochastically prunes the network via dropout applied to this set. The resulting network learns to be explicitly robust to pruning, comparing favourably to more complicated regularization schemes while at the same time being extremely simple to implement, and easy to tune.
Other authorsSee publication -
Unsupervised Cipher Cracking Using Discrete GANs
Arxiv
This work details CipherGAN, an architecture inspired by CycleGAN used for inferring the underlying cipher mapping given banks of unpaired ciphertext and plaintext. We demonstrate that CipherGAN is capable of cracking language data enciphered using shift and Vigenere ciphers to a high degree of fidelity and for vocabularies much larger than previously achieved. We present how CycleGAN can be made compatible with discrete data and train in a stable way. We then prove that the technique used in…
This work details CipherGAN, an architecture inspired by CycleGAN used for inferring the underlying cipher mapping given banks of unpaired ciphertext and plaintext. We demonstrate that CipherGAN is capable of cracking language data enciphered using shift and Vigenere ciphers to a high degree of fidelity and for vocabularies much larger than previously achieved. We present how CycleGAN can be made compatible with discrete data and train in a stable way. We then prove that the technique used in CipherGAN avoids the common problem of uninformative discrimination associated with GANs applied to discrete data.
Other authorsSee publication
More activity by Ivan
-
Cohere co-founder and CTO, Ivan Zhang joined FEMA CIO Charles R. Armstrong for a dialogue on Cloud Security in the Era of #AI. GVP Peter Guerra from…
Cohere co-founder and CTO, Ivan Zhang joined FEMA CIO Charles R. Armstrong for a dialogue on Cloud Security in the Era of #AI. GVP Peter Guerra from…
Liked by Ivan Zhang
-
Shipd by Datacurve (https://shipd.datacurve.ai) literally pays you to solve coding problems - it's every CS nerd's dream We're excited that over…
Shipd by Datacurve (https://shipd.datacurve.ai) literally pays you to solve coding problems - it's every CS nerd's dream We're excited that over…
Liked by Ivan Zhang
-
Bowen Yang from Cohere will dive into the world of long-context models and how we can push the boundaries of transformer architectures. At TMLS2024…
Bowen Yang from Cohere will dive into the world of long-context models and how we can push the boundaries of transformer architectures. At TMLS2024…
Liked by Ivan Zhang
-
AI Demo Days #2 is happening!! Thanks to Maxime Voisin and Ivan Zhang for hosting at Cohere offices. Looking forward to seeing the latest and…
AI Demo Days #2 is happening!! Thanks to Maxime Voisin and Ivan Zhang for hosting at Cohere offices. Looking forward to seeing the latest and…
Liked by Ivan Zhang
-
Just merged a PR to reduce our GPU usage so Taylor could fly private 170 times a year.
Just merged a PR to reduce our GPU usage so Taylor could fly private 170 times a year.
Liked by Ivan Zhang
-
Our team has crafted ready-made guides and notebooks, now easily accessible through our brand-new cookbook library! Dive into step-by-step…
Our team has crafted ready-made guides and notebooks, now easily accessible through our brand-new cookbook library! Dive into step-by-step…
Liked by Ivan Zhang
-
I'm excited to share that I'm continuing my journey at Cohere as a Engineering Program Manager Intern starting in the London, UK office! Grateful…
I'm excited to share that I'm continuing my journey at Cohere as a Engineering Program Manager Intern starting in the London, UK office! Grateful…
Liked by Ivan Zhang
-
I had the pleasure of speaking at Microsoft Build on how to improve retrieval augmented generation (RAG) systems. While it was too short to go into…
I had the pleasure of speaking at Microsoft Build on how to improve retrieval augmented generation (RAG) systems. While it was too short to go into…
Liked by Ivan Zhang
People also viewed
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreOthers named Ivan Zhang in Canada
28 others named Ivan Zhang in Canada are on LinkedIn
See others named Ivan Zhang