Like everyone else in the space, I’ve been asked a lot about DeepSeek, and I wanted to share my initial thoughts and observations.
The Power of Open-Source
It’s great to see the DeepSeek team open-sourcing their model and publishing their work for others to review. One of the main reasons the ML community has been able to flourish is its ability to collaborate and share ideas. However, with the sheer amount of investment flowing into AI, this openness appeared to be gradually shifting behind the closed doors of select companies. Yet, Computer Science always finds a way to optimize and make things more cost-efficient.
Understanding the True Costs
There’s a narrative circulating about the $6M cost of DeepSeek’s model, but it’s important to clarify that this reported figure represents the estimated compute cost. It does not account for associated staff, discarded training runs, or the thousands of GPU cards used in the process.
Testing Matters
Whether you’re considering downloading the model or using the hosted version, it’s important to thoroughly red team and penetration test it before introducing it into your organization. It’s no different from downloading any other software from the internet or uploading sensitive documents to a website.
Geopolitical Stakes and the Global AI Race
There has been discussion about the geopolitical implications, especially given the massive investments coming from the US and China. Recent shifts in US tech stocks highlight the stakes at play. Both countries have committed substantial funding and have access to vast talent pools. That said, it’s worth noting that many other nations, including the UK, are pursuing their own AI strategies, with varying levels of government intervention or support, depending on your perspective.
And yes, you can use Mindgard to test DeepSeek. Interested? Let’s set up time to chat - https://mindgard.ai/demo