Profile Picture
  • All
  • Search
  • Images
  • Videos
    • Shorts
  • Maps
  • News
  • More
    • Shopping
    • Flights
    • Travel
  • Notebook
Report an inappropriate content
Please select one of the options below.
Kiabi
Deepcool
CES 2023
IA
Orbi
SK
CES 2023
What's New at
CES 2023
Forvia
CES 2023
Where Is
CES 2023
Cenace
2023
Podcast Engadget
You Got the
CES
Aptiv 2023 CES 2023
Trunk Open
BMW I Vision Dee
New Air Coolers
CES
Las Vegas Show 2016 Video
Laptop MSI
Aptiv 2023 CES 2023
Trunk
CES
Las Vegas Shhow 2026 Video
Linus Tech
CES
2026 and Iot
Ceslasveggasshhowvideo2026
Air Cooler Fan
Create TV
2023
CES
Crease Less Foldables
Fast Lane to Vegas
CES 2023
Location
Best New
CES 2023
Splunk MSI Final Update
MSI Laptop
CES 2023
John Dick
  • Length
    AllShort (less than 5 minutes)Medium (5-20 minutes)Long (more than 20 minutes)
  • Date
    AllPast 24 hoursPast weekPast monthPast year
  • Resolution
    AllLower than 360p360p or higher480p or higher720p or higher1080p or higher
  • Source
    All
    Dailymotion
    Vimeo
    Metacafe
    Hulu
    VEVO
    Myspace
    MTV
    CBS
    Fox
    CNN
    MSN
  • Price
    AllFreePaid
  • Clear filters
  • SafeSearch:
  • Moderate
    StrictModerate (default)Off
Filter
    Kiabi
    Deepcool
    CES 2023
    IA
    Orbi
    SK
    CES 2023
    What's New at
    CES 2023
    Forvia
    CES 2023
    Where Is
    CES 2023
    Cenace
    2023
    Podcast Engadget
    You Got the
    CES
    Aptiv 2023 CES 2023
    Trunk Open
    BMW I Vision Dee
    New Air Coolers
    CES
    Las Vegas Show 2016 Video
    Laptop MSI
    Aptiv 2023 CES 2023
    Trunk
    CES
    Las Vegas Shhow 2026 Video
    Linus Tech
    CES
    2026 and Iot
    Ceslasveggasshhowvideo2026
    Air Cooler Fan
    Create TV
    2023
    CES
    Crease Less Foldables
    Fast Lane to Vegas
    CES 2023
    Location
    Best New
    CES 2023
    Splunk MSI Final Update
    MSI Laptop
    CES 2023
    John Dick
You now convert any LLM into a faster one without retraining from scratch.NVIDIA just did this to their 30B model. Here's the trick:1. Duplicate the model into two copies2. Freeze one copy, it just reads the prompt and remembers context3. Train the other copy to write chunks of text at once instead of one word at a time4. Run them togetherThe frozen copy barely costs anything (it's already trained). The new copy only needed ~8% of the original training data to learn the new trick.Result: 2.4x fa
0:13
You now convert any LLM into a faster one without retraining from scratch.NVIDIA just did this to their 30B model. Here's the trick:1. Duplicate the model into two copies2. Freeze one copy, it just reads the prompt and remembers context3. Train the other copy to write chunks of text at once instead of one word at a time4. Run them togetherThe frozen copy barely costs anything (it's already trained). The new copy only needed ~8% of the original training data to learn the new trick.Result: 2.4x fa
103.4K views1 day ago
x.comLior Alexander
See more
Static thumbnail place holder
More like this
  • Privacy
  • Terms