GitHub - exo-explore/exo: Run your own AI cluster at home with everyday

Several videos on running on apple hardware:

And Perplexity’s summary.

Apple MLX

It appears the above demos uses Apple MLX.

  • Unified memory: A notable difference from MLX and other frameworks is the unified memory model. Arrays in MLX live in shared memory. Operations on MLX arrays can be performed on any of the supported device types without transferring data.