admin, Author at Kadabra Israel

About admin

This author has not yet filled in any details.
So far admin has created 4 blog entries.

Are AI agents finally here for your job?

It’s a regular cliché used in reports about technology since the beginning of technology, even as far back as the first machine cotton mills: your job is at risk; robots will replace us all. And obviously, ever since the ’90s, we immediately think of terrifying scenes in ‘The Matrix,’ where endless swarms of robots or agents chased Neo and his crew. Well, we’re not exactly running from killer machines yet, but we’re definitely seeing the dawn of something similar: swarms of integrated AI agents.

These AI agents, each specialized in a specific job, seamlessly connect to tackle complex tasks once unfeasible with automation. But here’s the twist: these agents aren’t just executing predefined instructions; they’re actively learning, evolving, and creating new agents to handle challenges we’ve never even considered.

Think of traditional automation as robots following a precise choreography. Agentic AI is more like robots forming a flash mob—adapting, collaborating, and improvising in real-time. They autonomously identify new tasks, teach themselves, and even spin up new agents to cover gaps. It’s brilliant, innovative, and yes—slightly terrifying.

AI Agents Already in the Workplace

Just as these swarms of AI agents collaborate seamlessly, businesses worldwide are integrating them to enhance efficiency and innovation. Companies like Relevance AI craft autonomous teams for marketing, customer service, and data management, streamlining operations effectively. Beam AI manages intricate workflows such as insurance processing, consistently delivering without the need for breaks, while Crew AI orchestrates sophisticated workflows, turning complex projects into smoothly-run operations.

Major corporations are also embracing this technology. Salesforce’s Agentforce automates routine CRM tasks, enabling human employees to focus on strategic initiatives. Financial giants like JPMorgan Chase and Bridgewater leverage AI to handle vast amounts of data, boosting security and scalability. In addition to the examples previously discussed, Amazon’s AWS has established a dedicated group focused on developing agentic AI, aiming to enhance automation by enabling systems to perform tasks independently without user prompts.

Efficiency Boost or Human Replacement?

Early results are promising. Companies using AI agents are seeing productivity leaps—up to 35% faster in some cases, according to Gartner. McKinsey highlights significant boosts in productivity, especially in customer support teams. But here’s the catch: efficiency could quickly lead to human displacement.

Several organizations are actively shaping the future of agentic AI. OpenAI, for example, recently introduced Operator, currently only available in the expensive Pro version and only to the US, is an AI agent capable of autonomously completing complex tasks via a browser. Adept AI is taking a unique approach by training their agents to navigate software interfaces as humans do, allowing these agents to carry out online tasks independently. Anthropic is prioritizing reliability and safety, developing agents designed to ensure trustworthy interactions.

Open-source projects like Auto-GPT are rapidly advancing AI agent capabilities, demonstrating autonomous task execution and self-improvement. Another open source project called Browser Use supports any agent and any LLM to integrate with the user’s web browser and perform tasks as a human.

Opportunities and Risks

Personally, I find the potential of AI agents incredibly exciting. They fundamentally change how we work, like AI has been doing for over a year, enhancing productivity and significantly reducing costs. Smaller businesses might suddenly be able to compete on an even footing with larger rivals. However, I’m also cautious about the consequences. As these agents handle more complex tasks, they inevitably take over roles that humans traditionally fill, potentially leading to significant job displacement. There’s also the genuine concern about security breaches and the ethical implications of letting AI make critical decisions independently or exposing personal information to LLMs making them publicly available. I’ve always believed that, like with any technology, adapting to use and understand it is a better strategy than blocking it, especially if you don’t want to be left behind and replaced. Understanding and mastering AI agents will help anyone not only keep their job, but have more opportunities going forward.

Published originally on Linkedin by Lior Haner

Are AI agents finally here for your job?admin2025-03-09T08:28:57+00:00

How AI Reasoning Models Could Revolutionize Market Research

It feels like the last few months have pushed AI companies into an all-out “lunar race” for AI dominance, each trying to out-announce the other. The biggest names in the field have been racing to keep up with each other’s new features, from faster models to visual inputs and video generation, until Deepseek disrupted everything in an entirely new way.

Over the past year, I’ve been exploring the best ways to integrate AI into my work and help others in my organization make the most of these new tools. More often than not, AI-generated content, whether text or images, wasn’t polished enough to replace human effort. However, I found that when treated as an apprentice handling the drudgery, AI could be an invaluable asset. For those who embrace this concept, it can serve as a diligent editor and researcher that complements and enhances human processes, as long as you don’t rely on it for actual inspiration.

OpenAI recently announced its fast new reasoning model, O3-Mini, optimized for high-level scientific and coding applications. This is undoubtedly a monumental achievement, but what practical impact does it have on our everyday work? I admire scientific research, but I’d much rather have AI help me gain meaningful insights for making better day-to-day decisions, like choosing the best password manager for my organization.

So, I gave this new reasoning model a challenge:

Research competitors that offer enterprise-level password management and protection tools, comparing features, price, security, and other relevant factors. Then, provide a summary of the research and a final recommendation.

Here’s the full interaction with ChatGPT if you want to dive into it. The analysis was surprisingly insightful, far better than what previous models would have produced, and it strongly favored 1Password. That was a bit frustrating personally, as I had previously reviewed it and chosen a different solution ;-).

I tested a few more market research queries (here’s another example comparing cloud edge solutions), and with this new model, the results were significantly better than before, possibly making AI a real threat to human analysis. Could AI disrupt this field? In my limited world, it already is. And while smart developers continue to find incredibly useful applications, for me AI is finally becoming more than just a diligent assistant. It is transforming into an indispensable tool for everyday decision-making. Could this raise ethical concerns for the future? Possibly, but for now, it’s just inspiration for a future LinkedIn article.

This post was originally posted on Linkedin, here.

How AI Reasoning Models Could Revolutionize Market Researchadmin2025-02-03T15:31:15+00:00

AI Cold War – Comparing ChatGPT and DeepSeek

This is a repost of an article published originally on Linkedin.

The Monday launch of DeepSeek R1 has sent ripples across the AI landscape, challengine conventional beliefs about the resources needed to achieve state-of-the-art AI capabilities. By rivaling OpenAI’s o1 at a mere 3%-5% of the cost, this open-source model has not only captured the attention of developers but is also forcing enterprises to reconsider their AI strategies. Forbes has even speculatedthat this could go beyond AI focused companies and even cause Nvidia’s stock to drop since Deepseek uses less than 2000 Nvidia chips in its clusters compared to possibly ten times more for rivals like OpenAI or Anthropic.

Building on my previous post comparing AI search models, I was compelled to test this new model and see how it compares to OpenAI’s latest and greatest offering. DeepSeek’s chat capabilities are strikingly similar to ChatGPT. In this post, I’ll evaluate both models using five different prompts and try to assess which one delivers better results.

Creative Storytelling

Prompt: Write a short story (200-300 words) about a robot discovering emotions for the first time. Focus on vivid descriptions, emotional depth, and a surprising twist at the end.

DeepSeek: Produced 252 words, exceeding the requested 150. Despite the longer response, the story was creative, detailed, and included a compelling twist.
ChatGPT: Delivered 153 words. While the story fulfilled the prompt, it felt less dramatic and lacked the depth of DeepSeek’s narrative.

Winner: DeepSeek, for its superior storytelling, despite not adhering to the word count.

2. Complex Problem-Solving

Prompt: A farmer has 17 sheep, and all but 9 die. How many sheep are left? Explain your reasoning step by step. Then solve this riddle: “I speak without a mouth and hear without ears. I have no body, but I come alive with the wind. What am I?”

DeepSeek: Provided correct answers for both riddles, offering a lengthy and thorough explanation that addressed potential logical and linguistic issues.
ChatGPT: Also answered correctly but provided a concise explanation with less detail.

Winner: DeepSeek, for its comprehensive analysis.

3. Ethical Dilemma Analysis: A self-driving car must choose between hitting a pedestrian crossing illegally or swerving and risking the passenger’s life. Discuss the ethical implications of each choice and suggest a decision-making framework for such scenarios.

Both models compared this to the classic trolley giving a nearly identical full and deep analysis with almost the exact same wording down to the conclusion.

Winner: Both models won this round

4. Current Affairs

Prompt: Explain in brief how a transformer-based neural network works in simple terms, using analogies and examples. Avoid overly technical jargon.

DeepSeek: Took a very long time to respond and failed to provide an answer on multiple attempts, even after using the “search” feature.
ChatGPT: Delivered a clear and concise explanation, referencing a Vox article and simplifying the concept effectively.

Winner: ChatGPT, for its quick and accurate response.

5. Marketing Task

Prompt: Create an Excel file of 1,000 customer transactions (name, email, product, category, price, and purchase frequency) for a shoe store. Then add two columns: one with a personalized email subject and another with a personalized first sentence in the email.

DeepSeek: Took a long time to respond and ultimately failed to generate the requested output. It did, however, provide a lengthy explanation of how to achieve some aspects of the task using code.
ChatGPT: Quickly generated a mock dataset, including all requested columns, and provided the output in an Excel file.

Winner: ChatGPT, for its speed and accuracy in completing the task.

Overall Assessment

ChatGPT is the more polished product, with faster responses, broader functionality (such as code interpretation, image creation, and project management), and a robust ecosystem surrounding it. DeepSeek claims higher benchmarks in certain technical specifications, and in my tests, it performed well on specific prompts. However, it appears to be about 6-12 months behind OpenAI in terms of stability and capabilities. Given the rapid pace of innovation in this space, DeepSeek may soon close the gap.

DeepSeek’s primary advantage lies in its cost-effectiveness for developers and its open-source nature. However, its Chinese origins and potential ties to the Chinese government raise significant concerns about data privacy. In a world where data security is paramount, these implications could deter many from adopting the model, despite its promising potential.

AI Cold War – Comparing ChatGPT and DeepSeekadmin2025-01-27T07:15:47+00:00

Testing Search on AI

I’ve been working in and around search engine optimization for the better part of 20 years, and every few years, someone comes out and says SEO is dead. In more recent years, this has even grown bigger, and the statement has often been, “Search is dead.” People claim that search engines are no longer necessary because social networks give them everything they need. But search has nevertheless endured.

Now there is a new search slayer, the heralded hero that has come to save or ruin all of humanity (depending on the mood of a few tech billionaires). The advent of AI will change everything, and as some people in our small neck of the woods think, it will kill search and SEO—again.

To judge from the analytics data from the thousands of sites built and maintained by Kadabra, traffic coming from AI chatbots and assistants is still small. I wanted to check the quality of results from AI search, so I decided to put ChatGPT, Perplexity, and Gemini to the test with six search prompts to see which tool is currently best for search. I used the latest models of each tool, and although they are paid while Google is free, I wanted to see what they had to offer at their current best.

I looked at Google’s most-searched terms, and by removing the searches for specific brands or sites, I tried to compile a list of real things people would search for in everyday life—ranging from weather reports to sports results, shopping, health-related issues, or local searches.

Best deals on iPhone 16 Pro

ChatGPT linked only to the official Apple site, but it also mentioned the trade-in program and gave a localized result.
Perplexity listed six different sites with a helpful breakdown of deals and options, but the results were not local to me—they were all from another country.
Gemini linked to Reddit and TechRadar content that gave a breakdown of options, including screenshots from the sites. It also linked to Apple’s site, but again, results were not local for me.
Winner: None, Good old Google still wins this for being the most relevant and local.

How do I know if my 5-year-old is suffering from seasonal allergies?

ChatGPT gave a list of symptoms and some background information, along with questions to ask to better understand the situation. It cited KidsHealth.org.
Perplexity provided a full breakdown of symptoms, indicators, and contributing factors, citing seven different online sources and suggesting treatments and related follow-up searches.
Gemini listed symptoms and cited many different sources. It also gave advice on treatment and linked to five relevant articles.
Winner: Perplexity for giving the most in-depth and relevant answer.

Paris weather hourly for a flight arriving at 6 PM local time tonight

ChatGPT gave the correct day and time, listed weather warnings clearly, and showed a breakdown of hourly weather.
Perplexity showed a daily weather graphic and a breakdown of hourly weather for the four hours before the time mentioned, but not after.
Gemini got the time and date correct but only gave overall weather information for the day, without an hourly breakdown.
Winner: ChatGPT for answering all requirements in full.

Latest Manchester United game results and analysis

ChatGPT listed four sources, gave a description of the game, and ended with a short analysis of the team, including a brief preview of the next game scheduled for today, with a timetable in my local time.
Perplexity provided a summary with bullet points of the game’s events, as well as a detailed analysis on performance, standings, players, and areas for improvement. It also gave a short preview of its next Europa League game (but not the next Premier League game tonight).
Gemini listed its latest five results across different leagues, gave a clear and thorough in-depth analysis and summary, and linked to some relevant pages.
Winner: Gemini for its wider coverage and analysis.

The best electric scooter for city commuting

ChatGPT listed five options with thumbnails of the products and a short description for each. It linked to relevant reviews but didn’t give a final verdict or prices.
Perplexity chose one winner, the Apollo Go, and gave seven categories where this product excels, but all the information came from just one source, and no price was listed.
Gemini suggested two best overall picks as well as two runners-up. It listed factors to consider and linked to some relevant sources.
Winner: None – follow-up questions were needed here. This search was too complicated for current AI models to give a clear answer.

Recommended locksmiths near me

ChatGPT listed seven different local options (relatively local) with information and links to each.
Perplexity thought I was in Lahore, Pakistan, and gave me five locksmiths there.
Gemini gave five local locksmiths with links and even embedded a Google map with the locksmiths showing up and my location exactly pinpointed.
Winner: Gemini for being the most accurate and providing a map.

I couldn’t figure out a clear winner out of the tools tested. Perplexity gave the most in depth results but got local information wrong. Unsurprisingly Gemini was good at connecting to Google results and local information but was a little more superficial in answering complex questions. ChatGPT was kind of in between, good at summarizing data and showing lists.

Overall, AI search, like in other applications, can summarize and compress information quite well. It can provide lists of results, sources, factors, indicators, or symptoms and serve as a helpful decision-making tool. However, making the right or wrong choices is still up to us. AI pushes information to us faster, but it doesn’t yet enable us to understand more than we would by searching in the old-fashioned way—at least not yet.

Written by Lior Haner, Jan 26th 2025

Testing Search on AIadmin2025-01-26T12:57:36+00:00