HMT: Hierarchical Memory Transformer for Long Context Language Processing

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

HMT-pytorch

1 38 8.1 Python

Official Implementation of "HMT: Hierarchical Memory Transformer for Long Context Language Processing"

Code: https://github.com/OswaldHe/HMT-pytorch
This looks really interesting. I've the paper to my reading list and look forward to playing with the code. I'm curious to see what kinds of improvements we can get by agumenting Transformers and other generative language/sequence models with this and other mechanisms implementing hierarchical memory.[a]
We sure live in interesting times!
---
[a] In the past, I experimented a little with transformers that had access to external memory using https://github.com/lucidrains/memorizing-transformers-pytorc... and also using routed queries with https://github.com/glassroom/heinsen_routing . Both approaches seemed to work, but I never attempted to build any kind of hierarchy with those approaches.

memorizing-transformers-pytorch

6 614 2.6 Python

Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch

Code: https://github.com/OswaldHe/HMT-pytorch
This looks really interesting. I've the paper to my reading list and look forward to playing with the code. I'm curious to see what kinds of improvements we can get by agumenting Transformers and other generative language/sequence models with this and other mechanisms implementing hierarchical memory.[a]
We sure live in interesting times!
---
[a] In the past, I experimented a little with transformers that had access to external memory using https://github.com/lucidrains/memorizing-transformers-pytorc... and also using routed queries with https://github.com/glassroom/heinsen_routing . Both approaches seemed to work, but I never attempted to build any kind of hierarchy with those approaches.

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
memorizing-transformers-pytorc

3 - -

Code: https://github.com/OswaldHe/HMT-pytorch
This looks really interesting. I've the paper to my reading list and look forward to playing with the code. I'm curious to see what kinds of improvements we can get by agumenting Transformers and other generative language/sequence models with this and other mechanisms implementing hierarchical memory.[a]
We sure live in interesting times!
---
[a] In the past, I experimented a little with transformers that had access to external memory using https://github.com/lucidrains/memorizing-transformers-pytorc... and also using routed queries with https://github.com/glassroom/heinsen_routing . Both approaches seemed to work, but I never attempted to build any kind of hierarchy with those approaches.

heinsen_routing

8 160 2.7 Python

Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in All Domains" (Heinsen, 2019), for composing deep neural networks.

Code: https://github.com/OswaldHe/HMT-pytorch
This looks really interesting. I've the paper to my reading list and look forward to playing with the code. I'm curious to see what kinds of improvements we can get by agumenting Transformers and other generative language/sequence models with this and other mechanisms implementing hierarchical memory.[a]
We sure live in interesting times!
---
[a] In the past, I experimented a little with transformers that had access to external memory using https://github.com/lucidrains/memorizing-transformers-pytorc... and also using routed queries with https://github.com/glassroom/heinsen_routing . Both approaches seemed to work, but I never attempted to build any kind of hierarchy with those approaches.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

What can LLMs never do?

4 projects | news.ycombinator.com | 27 Apr 2024
x-transformers

1 project | news.ycombinator.com | 31 Mar 2024
Large Language Models for Compiler Optimization

3 projects | news.ycombinator.com | 17 Sep 2023
The Eleuther AI Mafia

2 projects | news.ycombinator.com | 3 Sep 2023
From Deep to Long Learning

6 projects | news.ycombinator.com | 9 Apr 2023

HMT: Hierarchical Memory Transformer for Long Context Language Processing

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Deep Learning Artificial intelligence em-routing capsule-network attention-mechanism
Post date: 17 May 2024

HMT-pytorch

memorizing-transformers-pytorch

Scout Monitoring

memorizing-transformers-pytorc

heinsen_routing

Related posts

What can LLMs never do?

x-transformers

Large Language Models for Compiler Optimization

The Eleuther AI Mafia

From Deep to Long Learning

HMT: Hierarchical Memory Transformer for Long Context Language Processing

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Deep Learning Artificial intelligence em-routing capsule-network attention-mechanism Post date: 17 May 2024

HMT-pytorch

memorizing-transformers-pytorch

Scout Monitoring

memorizing-transformers-pytorc

heinsen_routing

Related posts

What can LLMs never do?

x-transformers

Large Language Models for Compiler Optimization

The Eleuther AI Mafia

From Deep to Long Learning

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Deep Learning Artificial intelligence em-routing capsule-network attention-mechanism
Post date: 17 May 2024