Two embedding clouds rotating into alignment, with English/Italian word pairs

Word Translation Without Parallel Data: A Beautiful Application of GANs and Linear Algebra

Three beauties meet in this paper. Words in one language end up internally aligned by meaning. For languages with shared structure, that arrangement is shaped almost the same way across both. A GAN and an SVD recover the rotation between them, strongly for close pairs, weakly for distant ones, with no parallel text.

May 26, 2026 · 24 min
Universal Sentence Encoder Visualization

Visualizing Context with Google's Universal Sentence Encoder and GraphDB

An interactive exploration of how Google’s Universal Sentence Encoder captures semantic context, visualized using graph databases. Read the full article on Medium → Originally published on Medium’s Data Science publication. This is one of my favorite articles showcasing the power of semantic embeddings through visual exploration.

September 1, 2024 · 1 min
DSPy Hard Negatives Optimization

DSPy in Action: Improving Optimizers Through HardNegatives

Exploring how to improve DSPy optimizers using hard negative examples to enhance model performance. Read the full article on Medium → Originally published on Medium’s Data Science publication.

August 15, 2024 · 1 min
PyTorch Custom Datasets for Text and Machine Translation

Custom Datasets in PyTorch (Part 2): Text & Machine Translation

Learn how to create custom datasets in PyTorch for text and machine translation tasks with practical examples. Read the full article on Medium → Originally published on Medium’s Data Science publication.

May 20, 2023 · 1 min