Good Ol' LSTM Thinking Steps

by TimeLordRaps - opened Jun 14

Jun 14

Do you remember how LSTMs can be fed back through themselves for thinking steps, how would I go about adapting this model for test-time aware thinking steps, where the model is meta-aware of its own necessary reasoning expectations, ie makes pre-estimates for quantity of thinking steps, would be useful to augment xLSTM LM with reasoning strength estimation, or just create a general adapter for reasoning estimation statistics generally. Happy researching, love the model.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment