Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model. This model also requires less VRAM, e.g. works well on M4 64GB with a speed of 10 tokens/s. https://ollama.com/library/llama3.3 Right…
Month: December 2024
Chat from LangChain
This is a significantly different RAG we used to think of: https://chat.langchain.com/. Take a look at the sample questions and observe how agents from LangChain and LangGraph utilize them to construct responses….
Genie 2, DeepThought-8B, snowflake-arctic-embed2
Google has recently released Genie 2, their most advanced large-scale foundation model capable of generating consistent and playable worlds for a minute. This advancement could be particularly beneficial for game developers, but…