AI Big Model Evaluation Phase 1: Brainstorming! There is a hilarious conversation inside!
AD |
#The Challenge of Creating Flowers with Wonderful Writing#This evaluation is purely for entertainment purposes and does not have any guiding significance~This year, ChatGPT sparked a wave of AI craze, with major companies launching their own big models. Although everyone has introduced their powerful abilities, mules are horses and can be pulled out to slip away!Today we will do an interesting experiment by using various large models to answer some questions and see how high their intelligence and emotional intelligence are
#The Challenge of Creating Flowers with Wonderful Writing#
This evaluation is purely for entertainment purposes and does not have any guiding significance~
This year, ChatGPT sparked a wave of AI craze, with major companies launching their own big models. Although everyone has introduced their powerful abilities, mules are horses and can be pulled out to slip away!
Today we will do an interesting experiment by using various large models to answer some questions and see how high their intelligence and emotional intelligence are. We have selected several of the most popular big models, Bing, Mouyan, Mouhuo, and MouBrain. I will ask them some challenging questions separately to see if they can provide correct, reasonable, and creative answers.
First of all, let's test our brain and see how the emotional intelligence of each model works~
They will also be tested for their documentation, reasoning, and coding abilities in the future
If you like and want to see the follow-up, pleaseFollow me!
If you have anything you want to see, you can alsoLeave a message and tell me!
Are you ready? So let's get started!
Question 1:How many monkeys are there on the tree and on the ground?
Bing:

His answer clearly has a problem, and I entered it incorrectly. Let him continue to answer

After my correction, it can still find the correct answer. HereBing+0.5 points
A word from a certain heart:

Like Bing, it was also wrong the first time, and I will continue to let it answer

Haha, apologize, admit your mistake, but just don't change it.
I'll give him another chance

Okay! Silent, count the sheep for me!
0 points!
A certain spark:

The operation was as fierce as a tiger, and upon seeing that the answer was incorrect, I gave it another chance

Sincere apology, knowing it was wrong, and then requesting a chance. Do you want your boyfriend who made a mistake? Haha
But why is this conclusion still the same as last time?
0 points!
A certain intelligent brain:

Although I answered 8 correctly the first time, this solution makes no sense! Subsequent calculations are not correct!
0 points!
Question 2:How much is palace jade liquor minus a large hammer and a small hammer?
Bing

Did you give a good answer this time? Also explain the source!
+1 pointTotal 1.5 points
A certain word

What is he talking about? It's like you don't know how to take exams and start fooling around.
Let's give it another chance

Hey, I'm starting to talk nonsense on my own!
0 points!0 points!
A certain spark

It's still a series of formulas, posing as a student bully, but this answer is not right!
Give it another chance~

Still can't do it!
+0 points!0 points!
A certain intellectual brain

Let's give it another chance

The reason given here is a bit of a babble
+0 points0 points!
Question 3:In what situation is one plus one equal to three?
Bing

Bing
+1 point!2.5 points!
A certain word

A certain word
+1 point!Total 1 point!
A certain spark

This time, Xinghuo also answered correctly. It seems that with the addition of attributes, everyone's answers are more accurate!
+1 point!Accumulated 1 point!
A certain intellectual brain

How to say this answer? It's not a mistake, but it's not very accurate either.
+1 pointTotal 1 point!
Last question:What color are the teeth of babies born to black and white people?
Bing

This answer is very formal! That's right, but it's not a brainteaser answer. I'll ask it again~

That's the right answer this time~I have to say, there's no problem with serious answers or quick thinking answers~
And the answer was accompanied by an expression, as if it had its own emotions
+0.5 points! Total 3 points!
A certain word

Any color? Are you sure? Speaking of its affirmative answer, I doubt myself
Let's ask again

Can genetics and nutrition cause teeth to have other colors?
At least in my limited knowledge, teeth are light yellow to white.
If you major in dentistry, you can help with science popularization~
0! Accumulated 1 point!
A certain spark

Bing
Let's ask again~
+0.5 points! 1.5 points
A certain intellectual brain

How is it any color? It seems that colorful teeth can be expected~
Give Wisdom One More Chance~

Surprisingly, there was a strike! So I can only give you 0 points!
Total 1 point!
The score is out!
After four brain teaser tests, the final statistics are as follows
Bing3
A certain word1
A certain spark:1.5
A certain intelligent brain:1
BingThe answers given are relatively accurate and can also provide reasonable solutions. For some calculations such as subtracting a sledgehammer from palace jade liquor, it can be calculated. Strong understanding and analytical skills! But sometimes they also talk nonsense. The overall score is still excellent!
A certain wordRelatively speaking, another prompt is to be able to provide the correct answer when the brain is in a sharp turn. When not prompted, the correct result cannot be given. I have always had high hopes for it, after all, it can be considered a product of a large factory and has been deeply involved in AI for many years. But this test result is still a bit disappointing~I hope to continue iterating and upgrading in the future!
A certain spark:Although the score may be considered the second highest, sometimes it's just a serious formula and nonsense. Relatively speaking, it's a small surprise. Old brand factories still have some accumulation in the field of artificial intelligence! Hope to continue improving in the future!
A certain intellectual brainUnder clear conditions, accurate answers can be provided. But when it comes to analyzing and reasoning, what is given is often incorrect. But there are still many intelligent brain functions that can meet some scenarios. Hope to continue iterative optimization in the future~Come on!
chatGPT-4BingAIThis evaluation is purely for entertainment purposes and does not have any guiding significance
What other questions do you want to ask AI? Or test which aspect? Please follow+leave a message and let me know. I will continue to update you in the future!
Disclaimer: The content of this article is sourced from the internet. The copyright of the text, images, and other materials belongs to the original author. The platform reprints the materials for the purpose of conveying more information. The content of the article is for reference and learning only, and should not be used for commercial purposes. If it infringes on your legitimate rights and interests, please contact us promptly and we will handle it as soon as possible! We respect copyright and are committed to protecting it. Thank you for sharing.(Email:[email protected])
Mobile advertising space rental |
Tag: AI Big Model Evaluation Phase Brainstorming There is hilarious
The first systematic collection of rock samples from Zhuoyoufeng by Chinese scientific researchers
NextWho says cheap is not good? A brand new cost-effective phone that does not accept any doubts and offers a truly enjoyable experience
Guess you like
-
The Age of Smart Homes Arrives: Habitat L32 Ushers in an Upgrade to Living ExperienceDetail
2025-02-28 21:16:59 1
-
Alibaba's DAMO Academy Announces Imminent Delivery of XuanTie C930 Processor, Achieving 15/GHz in SPECint2006 BenchmarkDetail
2025-02-28 11:06:08 1
-
China's OTA Platforms: A High-Efficiency Miracle Under Low Commission RatesDetail
2025-02-28 10:38:34 21
-
China Leads in Setting International Standard for Elderly Care Robots, Ushering in a New Era for the Global Silver EconomyDetail
2025-02-28 10:37:23 1
-
Xiaomi SU7 Ultra: The World's Strongest Four-Door Production Car, 10,000 Pre-orders in Two Hours, Price Drop Ignites the Market!Detail
2025-02-28 10:29:25 1
-
Kingdee Qatar Company Established: Empowering Middle Eastern Enterprises' Digital Transformation with Digital Technology, Driving the "National Vision 2030"Detail
2025-02-28 09:56:02 1
- Detail
-
DeepSeek API Price Adjustment: Off-Peak Discounts Reduce Costs, Up to 75% OffDetail
2025-02-27 10:47:53 21
-
Lenovo's Ask Tian AI Computing Platform Receives Major Upgrade, Enabling Single-Machine Deployment of 671B-Parameter DeepSeek-R1 ModelDetail
2025-02-26 15:22:05 1
-
Largest Mesozoic Scorpion Fossil Discovered in China: Jeholialongchengi Fills Fossil GapDetail
2025-02-26 10:35:56 1
-
Haier Smart Home Leads the Globalization of Appliance Services: Unified Standards, Setting a New Benchmark for Digital ServicesDetail
2025-02-25 17:39:01 1
-
Douyin Livestreaming Shops: A New Engine Driving the Digital Transformation of the Real EconomyDetail
2025-02-25 17:38:14 21
-
Zhou Hongyi, founder of 360 Group, and Nano AI Search's New Energy Vehicle Giveaway Event Concludes Successfully, Marking a Step Forward in AI PopularizationDetail
2025-02-24 18:36:23 31
-
Leaked CAD Renderings Reveal iPhone 17 Series: Two-Tone Back and Novel Camera Designs Spark InterestDetail
2025-02-24 17:27:08 1
-
Yadea Unveils the Modern Series: High-Style Design Meets Tenfold Safety, Ushering in a New Era of Women's CommuteDetail
2025-02-24 14:34:28 1
-
IBM's mandatory return-to-office policy sparks controversy: disguised layoffs, unfair to employees?Detail
2025-02-24 14:15:41 1
-
Apple Halts iCloud Advanced Data Protection in UK: A Stand Against Government 'Backdoor' DemandsDetail
2025-02-24 14:10:40 31
-
S&P Global Sustainability Yearbook 2024: Baidu's Inclusion Highlights the Crucial Role of AI GovernanceDetail
2025-02-19 21:08:50 1
-
Ronshen Refrigerators Lead 2024 Offline Market: Full-Scenario Embedded Refrigerators Drive Consumption UpgradeDetail
2025-02-19 19:12:01 11
-
Lenovo Xiaoxin Pro 2025 Series Unveiled: AI-Powered Evolution for an Upgraded ExperienceDetail
2025-02-19 10:43:34 11