Zephyrnet Logo

Tag: tricky

Metaverse Gambling Guide: Best Metaverse Casinos

Metaverse is more than a buzzword, it is an immersive 3D virtual world on the internet. But the Metaverse and its opportunities expand even...

When It Comes To Excess Inventory, Prevention Is Better Than Cure

Excess inventory – it’s taking up your warehouse space, tying up working capital, and limiting your planning team’s range of motion. It’s time to Marie...

Man Bets Entire Life Savings on Roulette

They say that those who don’t take risks won’t ever be great. Revell was 32 back then, and he was a London man who...

Have You Listened to our new Podcast yet?

The first Clarus podcast is out – please take a listen! Podcasting You may have noticed that we’ve had reason to celebrate a few milestones on the blog over the past year: 1,000 blogs on Clarus 400 blogs for both Chris and Amir With the blog continuing to be well received and well read (even […]

Choosing The Right Language Model For Your NLP Use Case

Large Language Models (LLMs) are Deep Learning models trained to produce text. With this impressive ability, LLMs have become the backbone of modern Natural Language Processing (NLP). Traditionally, they are pre-trained by academic institutions and big tech companies such as OpenAI, Microsoft and NVIDIA. Most of them are then made available for public use. This […]

The post Choosing The Right Language Model For Your NLP Use Case appeared first on TOPBOTS.

Dota 2 The International 2022 Highlights: Tundra Lifts Aegis, Underdogs Shine

Dota 2 The International 2022 Highlights: Tundra Lifts the Aegis Skip...

How to Identify And Minimise Rising Risks By Conducting DPIA?

Data privacy has become a buzzing topic on a worldwide level in the last couple of years. As cybercrime grows, so does the number...

Remarkable Tips for Listing and Selling Items on Multiple Platforms!

It’s no secret that listing and selling items on multiple platforms can help increase your reach and sales. However, knowing which platform best...

Crypto incubators have a responsibility to maintain fiscal discipline

Incubators provide a foundation for many crypto companies and have a responsibility to ensure they’re taking steps to survive in a bear market.

Ask a Techspert: How does Lens turn images to text?Ask a Techspert: How does Lens turn images to text?Keyword Contributor

When I was on holiday recently, I wanted to take notes from an ebook I was reading. But instead of taking audio notes or scribbling things down in a notebook, I used Lens to select a section of the book, copy it and paste it into a document. That got me curious: How did all that just happen on my phone? How does a camera recognize words in all their fonts and languages?

I decided to get to the root of the question and speak to Ana Manasovska, a Zurich-based software engineer who is one of the Googlers on the front line of converting an image into text.

Ana, tell us about your work in Lens.

I’m involved with the text aspect, so making sure that the app can discern text and copy it for a search or translate it — with no typing needed. For example, if you point your phone’s camera at a poster in a foreign language, the app can translate the text on it. And for people who are blind or have low vision, it can read the text out loud. It’s pretty impressive.

So part of what my team does is get Lens to recognize not just the text, but also the structure of the text. We humans automatically understand writing that is separated into sentences and paragraphs, or blocks and columns, and know what goes together. It’s very difficult for a machine to distinguish that, though.

Is this machine learning?

Yes. In other words, it uses systems (we call them models) that we’ve trained to discern characters and structure in images. A traditional computing system would have only a limited ability to do this. But our machine learning model has been built to “teach itself” on enormous datasets and is learning to distinguish text structures the same way a human would.

Can the system work with different languages?

Yes, it can recognize 30 scripts, including Cyrillic, Devanagari, Chinese and Arabic. It’s most accurate in Latin-alphabet languages at the moment, but even there, the many different types of fonts present challenges. Japanese and Chinese are tricky because they have lots of nuances in the characters. What seems like a small variation to the untrained eye can completely change the meaning.

What’s the most challenging part of your job?

There’s lots of complexity and ambiguity, which are challenging, so I’ve had to learn to navigate that. And it’s very fast paced; things are moving constantly and you have to ask a lot of questions and talk to a lot of people to get the answers you need.

When it comes to actual coding, what does that involve?

Mostly I use a programming language called C++, which enables you to run processing steps needed to take you from an image to a representation of words and structure.

Hmmm, I sort of understand. What does it look like?

A screenshot of some C++ code against a white background.

This is what C++ looks like.

The code above shows the processing for extracting only the German from a section of text. So say the image showed German, French and Italian — only the German would be extracted for translation. Does that make sense?

Kind of! Tell me what you love about your job.

It boils down to my lifelong love of solving problems. But I also really like that I’m building something I can use in my everyday life. I’m based in Zurich but don’t speak German well, so I use Lens for translation into English daily.

Decoding what the coders do: Ana works in Lens, focusing on text recognition. But what does that actually involve?

Why Online Gaming Is Fun

Introduction Technology has changed how things work worldwide, and since the introduction of the internet, people's lifestyles have changed. Most people rarely go out, especially...

Everyone can tackle water scarcity with Hydraloop

September 2022 By Catherine Jewell, Information and Digital Outreach Division, WIPO Hydraloop, a decentralized greywater recycling system, allows households to cut water consumption and wastewater emissions...

Latest Intelligence

spot_img
spot_img