Claude Opus 4 launches and presents it as the best programming model in the world

After Google will display all its artillery in artificial intelligenceAnthropic did not want to be left behind. The company founded by Dario Amodei has moved tab strongly: has presented Claude Opus 4 and Claude Sonnet 4two new models with which he aspires to leave his mark on the race for AI. The announcement star is Claude Opus 4, the most advanced model Anthropic has developed so far. And they do not walk with Rodeos: they assure that it is “the best programming model in the world. An ambitious statement that, as always, will have to be tested. But the first data places it very well positioned in front of its main rivals. In the benchmark Swe-Bench Verified, which evaluates real software engineering tasks, Opus 4 gets 72.5 % in standard conditions and reaches 79.4 % if the Parallel processing. It is a performance that leaves it above models such as GPT-4.1 (54.6 %), O3 (69.1 %) or the recent Gemini 2.5 Pro of Google (63.2 %). However, in other more demanding evidence in multimodal reasoning, such as GPQA Diamond or MMMU, focused on university level questions and complex scenarios that combine text and image, Opus 4 fails to overcome O3, which continues to lead in that field. A model with resistance and autonomy But beyond the numbers, what Anthropic wants to highlight is the resistance and autonomy of this model. Claude Opus 4 is capable of maintaining long work sessions and executing thousands of steps continuously. From the company they explain that this makes it an ideal basis for AI agents More sophisticated: systems that make decisions, complete tasks on their own and do not need constant human supervision. In parallel arrives Claude Sonnet 4, an evolution of the model that Anthropic launched in February. It is not intended to compete with Power Opus, but it offers a very balanced proposal between performance and efficiency. In coding it also makes an important leap with respect to its previous version: it goes from 62.3 % to 72.7 % in Swe-Bench Verified, and improves in reasoning tasks, instructions monitoring and general precision. Both models arrive with interesting news. For example, they can now alternate between reasoning and use of tools Within the same process, which allows more complete answers. They have also improved in reliability. According to Anthropic, they are 65 % less likely to take shortcuts or make serious mistakes than Sonnet 3.7. Claude Opus 4 and Sonnet 4 are already available in the API of Anthropic, at Amazon Bedrock and Google Cloud Vertex AI. They are included in the Pro, Max, Team and Enterprise plans. Prices are kept in the line of the previous models: Opus 4 costs $ 15 per million input tokens and 75 per million departure tokens. Sonnet 4 is more affordable: 3 and 15 dollars respectively. The latter can also be used from free accounts. Images | Anthropic In Xataka | We have tried the new Google AI mode: it is a direct bullet to the blue links that worries and excites in equal parts

Claude joins the list of attendees who challenge Google with updated responses

During the last decades, Google got so much in our lives that we ended up adopting the verb “google” as a synonym for looking for information. Phrases like “Give me a second to google” or “Did you google?” They became part of the day to day. But things are changing from the hand of artificial intelligence (AI). A giant who begins to give ground. Google is no longer the only access door to information. More and more users resort to attendees promoted by artificial intelligence (AI) to find answers. And that is where Perplexity, Chatgpt and now Claude come into play, who are gaining strength. Why we should pay attention. That a chatbot of AI can search the website makes the difference. Without that capacity, their answers are based only on the data with which it was trained, which can leave out key information or make it be left behind in the face of recent changes. Claude now search on the web. Anthropic’s chatbot is already able to obtain real -time internet information. In addition, it includes direct appointments so you can check the sources for yourself. Instead of showing only results, Claude processes them and presents the information in a conversational format easier to understand. Use scenarios. Now that Claude can search the web, the question is clear: in what cases does this function make the difference? Its creators have identified several key uses in which Internet access improves experience with chatbot, although the real limit is put by users and their needs. Account transformation and improvement in success rates in sales equipment through more informed conversations with customers, based on the analysis of the sector trends and the identification of key points. Market data assessment in real time, profits and tendencies of the sector to optimize investment decisions and improve the accuracy of financial models. Preparation of subsidy proposals and more solid bibliographic reviews thanks to access to primary sources on the web, the detection of emerging trends and the identification of lagoons in current research. Detailed comparison of products, prices and reviews in multiple sources to make more founded purchase decisions. At the moment, only in the US. The search function on the Claude website is now available for payment users in the United States. There is no confirmed date for its arrival in Europe. His arrival to the old continent is an unknown and it is not clear if he could be affected by local regulations, as has happened with Apple and Meta. Expansion is a matter of time. Anthropic states that Claude’s website will “soon” more countries already users of the free plan. The latter makes sense since the competition offers access to the search without going through a box. It is not known when it will expand, but what is undeniable is that competition in AI does not stop accelerating. Images | Anthropic In Xataka | The O1-PRO model of Chatgpt Pro is 140 times more expensive than the Chatgpt O3-mini. What is not clear is that it is much better

Claude 3.7 has returned to life an old 1997 program. The question is whether IA can translate old projects in Cobol or Fortran

A veteran programmer received a two -year -old granddaughter’s visit a few days. It was then that he recalled that in 1997 he scheduled an application in Visual Basic 4 an application that might have fun. The problem was that he only had the executable, and it was not easy to be able to use it in his current equipment, so he came up with an idea. Translate this old APP to Python. What he did, as said in Reddit (With a post -written post for this AI), it was to upload the executable as an entrance for the newly launched the Claude 3.7 model. He asked him for something simple: “Can you tell me how to run this file? I think I scheduled it with Visual Basic 4, it would be great to turn it to Python.” Inverse pseudoengineering. The surprising thing is that Claude 3.7 showed some warnings, but then began to analyze the binary file and identified some program components. In fact it went further and translated that code to Python using the Pygame bookstore. Modernized code. The AI ​​system managed to perfectly replicate the functionality of the original program, and the model also provided instructions to install and execute it without problem, but it also was executed and operated 100% from the first moment. And all in five minutes. And modifications, the ones you want. This user would then ask for some improvements, such as associating certain sounds to the space bar or adding color typefaces, and Claude 3.7 modified the code perfectly to offer those improvements. But. The user shared All conversation in Claude so that anyone could check the process that followed. As Visual Basic said P-Code (Pseudocode), a kind of intermediate compilation in which there are some recognizable elements, but not too many to help that reverse engineering task. A door to modernize old software projects. It is true that the pseudocode generated by VB facilitated that “reverse engineering” task, but still this experiment with Claude 3.7 seems to open the door for many other software projects to modernize. A way out for mastodons created with Cobol or Fortran? In fact, already very veteran programming languages ​​such as Cobol or Fortan are still very important in industries such as banking, and force to maintain systems that can maintain compatibility with these old applications. Lack of programmers. There are no longer many programmers who dedicate themselves to these programming languages, which makes them “translate” them to more modern programming languages ​​is especially interesting. In 2023 IBM showed precisely A IA -based project to translate Cobol programsbut it is not clear if that has allowed to complete ambitious projects in that sense. What Claude 3.7 of course opens the door to achieve it. THE EXAMPLE OF MOCAS. We have already talked about Moccas, software created in 1958 and used in the US administration for the “mechanization of administration service contracts.” It is scheduled in Cobol-in fact it was originally programmed in Flow-Matic-and continues to work today, but it has been a long time since in the US They look for ways to adapt it to the new times. Perhaps Claude 3.7 or similar AI models manage to offer a solution to that problem. Even so, difficult. The example shown by that veteran programmer is striking, but it was a small application. Projects such as Moccas are enormously more complex and are part of a series of interdependencies with other components, which makes this possible task of reverse engineering and translation more difficult. But be careful: it may not be impossible, and at least Claude 3.7 and other models can end up being precisely the tool we needed to carry out that task. Image | Flipflopflorida In Xataka | Young programmers no longer know how

There are too many AI models. That raises a true death sentence for Anthropic and Claude

We have AI models to bore. And the problem is that everyone starts looking too close and deciding which one is better not simple. All companies and startups strive to be referents in an absolutely unleashed market. One that as in other technological wars probably ends some winners and enough losers. And there are those who compete with clear disadvantages. Another colossal investment round. In The Wall Street Journal indicate That Anthropic is about to close a new financing round that would allow him to lift 3.5 billion dollars. That would make the company’s assessment amount to 61.5 billion dollars, and the question is whether the company really has options in such a competitive market. “This is not a real company”. According to analyst Ed Zitron, Claude has Two million active monthly users in January 2025. It also talks about how according to the WSJ projected revenues for 2025 (based on current contracts) is 1.2 billion dollars, a very modest figure. “They also lost 5.6 billion dollars last year,” Sign it. According to his opinion, Anthropic “is not a real company, they could not survive without the beneficence of risk capital.” Fierce competition. The truth is that Anthropic is facing exceptional competition in which the large heavyweights of the Tech industry are both in the US and in China. Deepseek surprised all of them with the launch of Deepseek V3 and after Deepseek R1, and that seems to have encouraged investors to bet even more money through all these companies. OpenAI is still a reference. At least, it is in number of users. According to CNBC They already have 400 million of active users every week, an exceptional figure that clearly puts them at the head of the popularity ranking in this segment. As with Claude, Openai is burning money that he does not have and that they obtain from extraordinary financing rounds, but unlike this, we insist, the popularity of Chatgpt is evident. And the big ones have what matters now: money. For many users IA is chatgpt, and giants such as Google with Gemini, Microsoft with Copilot or Meta with flame are still far from achieving that acceptance. They have something that Anthropic (or perplexity) does not have: many, many funds – Grok 3, from Xai is another example – and can be maintained in this race even if that is costing them a lot of money. The prize is too fat not to chase him. There are too many models, some can stay on the road. In all technological wars there have been winners and losers. It is the same as what this battle for AI points, in which there are too many competitors and that it probably ends up causing some of these efforts to not survive. Here Anthropic is one of those at a disadvantage. The AI ​​winner can be a company still unknown. Openai, Google, Apple or Microsoft may be especially well positioned to win that race, but it does not have to be so. As they recently indicated In axiosnew company can arise, still unknown, that end up doing something differential and what none of the greats had thought. It is not easy, but of course it is not impossible. Remembering Netscape. In the second half of the 9th Internet began to show their potential, but the great A small company called Netscape He managed to become a reference in the world of browsers. Then it would end up being the great loser of that war, but it was the demonstration that having more money and resources does not always have to have all the options. And that’s why so much investment in startups. That possibility that the one that wins the race will be an unknown company is precisely the one that makes risk capital companies investing a lot of money in projects that may not get absolutely at all. It has recently occurred with Thinking Machines Labthe Startup of Mira Murati, or with Safe Superintelligencethat of Ilya Sutskever. None of them have a product to show, but still have already received spectacular investments. And be careful, there is also China. Of course there are formidable rivals that are not in the US. Mistral is a reference in Europe, while In China another particular war is being fought which has made today the models of the AI ​​of Chinese companies are so good (or sometimes, better) than those of the US. The winner of this battle could also come from that country. Or any other, of course. Image | Saradash Pradhan In Xataka | China has an ambitious plan to overcome the West in Technology. And he has already chosen his 18 companies to get it

Anthropic launches Claude 3.7 Sonnet, a “hybrid” model that is better than ever. Not only that: also “reason”

Anthropic has announced The launch and availability of Claude 3.7 Sonnet, its new model of founding. The jump is promising, but stands out especially for one thing: they point to reasoning models. It is not Claude 4.0, it is Claude 3.7. The number of the new version confirms once again that the jump of benefits does not justify a more “round” number. Many expected Claude 4.0, but in Anthropic they make it clear that this is a much more evolutionary version than revolutionary. A hybrid model. In Anthropic they presume from having a hybrid model that does not differentiate between whether to talk and answer questions quickly, reason or any other application, because everything is based on the Claude 3.7 founding model, which does everything and behaves in that way Multidisciplinary. And as it does everything, it is somewhat more expensive than the competition: its API costs $ 3 per million input tokens and $ 15 per million departure tokens Claude can already “reason”. In a separate announcement Anthropic told us about his new mode of reasoning, called “Extended Thinking Mode”, which now becomes a more option among which we can display when using its model. If we activate it, the model “will think more deeply about complex questions.” As those responsible explain, this mode uses the same AI model, but does so by giving it more time and investing more effort to reach an answer. How Claude thinks. This mode of reasoning offers the possibility of seeing what the model is thinking when processing those answers. Here they warn that this information can be surprising, because we can see how AI can “think” incorrect things, but also show that process does not mean that the answer is only based on it. “Our results suggest that models often make decisions based on factors that are not explicitly discussed in their reasoning process.” Things are saved. That is: the model seems to keep things for yourself while thinking, but it is not clear which or why. There is another reason not to show everything: that raises security problems, since having all that information potentially gives resources to bad actors to take advantage of the model of inappropriate forms. Source: Anthropic You can play Pokémon alone. The new Anthropic model is also more “agéntico” than ever. It responds better to changes in the environment and continues to act until an open task has been completed. That makes The “Computer Use” function which allows AI to control our computer to be increasingly promising. They demonstrated it with Pokémon: Claude 3.7 came much further than previous models. Claude Code arrives. The Anthropic model has always highlighted in the scope of programming, and now they wanted to promote that capacity with Claude Code, a BASDA tool in Claude 3.7 Sonnet but specifically focused on helping programmers to develop their projects. A programming agent. This could also be considered as Anthropic’s first agent, because Claude Code is able to complete programming projects autonomously without needing user interaction. Thus, Claude can search between basis with code on which to base, read and edit files, write and execute tests, publish the code in Github repositories and execute commands on a console while informing developers of the entire process. He Anthropic demonstrative video It allows you to check some of those functions. Similar to Grok3 in performance. The new Grok 3 presented these days by XAI showed one more step in its performance in the most demanding benchmarks today, and Claude 3.7 is also in that line, which means that It is something superior In those tests to models such as O1 and O3-mini (from OpenAI) and Deepseek R1. In Xataka | I have tried Deepseek on the web and in my Mac. Chatgpt, Claude and Gemini have a problem

Log In

Forgot password?

Forgot password?

Enter your account data and we will send you a link to reset your password.

Your password reset link appears to be invalid or expired.

Log in

Privacy Policy

Add to Collection

No Collections

Here you'll find all collections you've created before.