Siri
Siri is a virtual assistant that is part of Apple Inc.'s iOS, iPadOS, watchOS, macOS, and tvOS operating systems. The assistant uses voice queries and a natural-language user interface to answer questions, make recommendations, and perform actions by delegating requests to a set of Internet services. The software adapts to users' individual language usages, searches, and preferences, with continuing use. Returned results are individualized.
Siri is a spin-off from a project originally developed by the SRI International Artificial Intelligence Center. Its speech recognition engine was provided by Nuance Communications, and Siri uses advanced machine learning technologies to function. Its original American, British, and Australian voice actors recorded their respective voices around 2005, unaware of the recordings' eventual usage in Siri. The voice assistant was released as an app for iOS in February 2010, and it was acquired by Apple two months later. Siri was then integrated into iPhone 4S at its release in October 2011. At that time, the separate app was also removed from the iOS App Store. Siri has since become an integral part of Apple's products, having been adapted into other hardware devices over the years, including newer iPhone models, as well as iPad, iPod Touch, Mac, AirPods, Apple TV, and HomePod.
Siri supports a wide range of user commands, including performing phone actions, checking basic information, scheduling events and reminders, handling device settings, searching the Internet, navigating areas, finding information on entertainment, and is able to engage with iOS-integrated apps. With the release of iOS 10 in 2016, Apple opened up limited third-party access to Siri, including third-party messaging apps, as well as payments, ride-sharing, and Internet calling apps. With the release of iOS 11, Apple updated Siri's voices for more clear, human voices, started supporting follow-up questions and language translation, and additional third-party actions.
Siri's original release on iPhone 4S in 2011 received mixed reviews. It received praise for its voice recognition and contextual knowledge of user information, including calendar appointments, but was criticized for requiring stiff user commands and having a lack of flexibility. It was also criticized for lacking information on certain nearby places, and for its inability to understand certain English accents. In 2016 and 2017, a number of media reports said that Siri lacked innovation, particularly against new competing voice assistants. The reports concerned Siri's limited set of features, "bad" voice recognition, and undeveloped service integrations as causing trouble for Apple in the field of artificial intelligence and cloud-based services; the basis for the complaints reportedly due to stifled development, as caused by Apple's prioritization of user privacy and executive power struggles within the company.
Development
Siri is a spin-out from the SRI International Artificial Intelligence Center, and is an offshoot of the US Defense Advanced Research Projects Agency's -funded CALO project. It was co-founded by Dag Kittlaus, Tom Gruber, and UCLA alumnus Adam Cheyer. Kittlaus named Siri after a co-worker in Norway, and it means "beautiful woman who leads you to victory" in Norwegian.Siri's speech recognition engine was provided by Nuance Communications, a speech technology company. This was acknowledged by neither Apple nor Nuance for years, until Nuance CEO Paul Ricci confirmed the information at a 2011 technology conference. The speech recognition system makes use of sophisticated machine learning techniques, including convolutional neural networks and long short-term memory.
The initial Siri prototype was implemented using the Active platform, a joint project between the Artificial Intelligence Center of SRI International and the Vrai Group at Ecole Polytechnique Fédérale de Lausanne. The Active platform was the focus of a Ph.D. thesis led by Didier Guzzoni, who joined Siri as its chief scientist.
Siri was acquired by Apple Inc. in April 2010 under the direction of Steve Jobs. Apple's first notion of a digital personal assistant was originally a concept video in 1987, called the Knowledge Navigator.
Voices
The original American voice of Siri was provided in July 2005 by Susan Bennett, unaware that it would eventually be used for the voice assistant. A report from The Verge in September 2013 about voice actors, their work, and machine learning developments, made hints that Allison Dufty was the voice behind Siri, though this was disproven when Dufty wrote on her website that she was "absolutely, positively not the voice of Siri." Citing growing pressure, Bennett revealed her role as Siri in October, and her claim was proven by Ed Primeau, an American audio forensics expert. Apple has never confirmed the information.The original British male voice was provided by Jon Briggs, a former technology journalist. Having discovered that he was the voice of Siri by watching television, he first spoke only about his role in November 2011, also acknowledging his voice work was done "five or six years ago" without knowing the recordings' final usage form.
The original Australian voice was provided by Karen Jacobsen, a voice-over artist known in Australia for her work as the GPS girl.
As part of an interview between all three voice actors and The Guardian, Briggs stated that "the original system was recorded for a US company called Scansoft, who were then bought by Nuance. Apple simply licensed it."
With iOS 11, Apple auditioned hundreds of candidates to find a new female voices, then recorded hours of speech, including different personalities and expressions, and built a new text-to-speech voice based on deep learning technology.
Integration
Siri was originally released as a stand-alone application for the iOS operating system in February 2010, and at the time, the developers were also intending to release Siri for Android and BlackBerry devices. Two months later, Apple acquired Siri. On October 4, 2011, Apple introduced the iPhone 4S with a beta version of Siri. After the announcement, Apple removed the existing standalone Siri app from App Store. TechCrunch wrote that, despite the Siri app's support for iPhone 4, its removal from App Store might also have had a financial aspect for the company, in providing an incentive for customers to upgrade devices. Third-party developer Steven Troughton-Smith, along with 13-year-old Zak Shafi, however, managed to port Siri to iPhone 4, though without being able to communicate with Apple's servers. A few days later, Shafi and Troughton-Smith, working with an anonymous person nicknamed "Chpwn", managed to fully hack Siri, enabling its full functionalities on iPhone 4 and iPod Touch devices. Shafi split from the group, founding SimplySiri with other developers to provide public access to the new founded exploit. Additionally, developers were also able to successfully create and distribute legal ports of Siri to any device capable of running iOS 5, though a proxy server was required for Apple server interaction.Over the years, Apple has expanded the line of officially supported products, including newer iPhone models, as well as iPad support in June 2012, iPod Touch support in September 2012, Apple TV support, and the stand-alone Siri Remote, in September 2015, Mac and AirPods support in September 2016, and HomePod support in February 2018.
Venture capital firm Mangrove Capital Partners predicted that Apple will launch a SiriOS at its developer conference in 2020 to further grow the Siri ecosystem. SiriOS could rival something like Amazon's Alexa Skills platform, which makes it easy for developers to implement Alexa functionality. Apple currently offers SiriKit to developers, but one possibility is that a SiriOS could work across iOS, iPadOS, and macOS with ease.
Features and options
Apple offers a wide range of voice commands to interact with Siri, including, but not limited to:- Phone and Text actions, such as "Call Sarah", "Read my new messages", "Set the timer for 10 minutes", and "Send email to mom"
- Check basic information, including "What's the weather like today?" and "How many dollars are in a Euro?"
- Schedule events and reminders, including "Schedule a meeting" and "Remind me to"
- Handle device settings, such as "Take a picture", "Turn off Wi-Fi", and "Increase the brightness"
- Search the Internet, including "Define...", "Find pictures of...", and "Search Twitter for..."
- Navigation, including "Take me home", "What's traffic like on the way home?", and "Find driving directions to..."
- Translate words and phrases from English to a few languages, such as "How do I say where is the nearest hotel in French"
- Entertainment, such as "What basketball games are on today?", "What are some movies playing near me?", and "What's the synopsis of...?"
- Engage with iOS-integrated apps, including "Pause Apple Music" and "Like this song"
- Handle payments through Apple Pay, such as "Apple Pay 25 dollars to Mike for concert tickets" or "Send 41 dollars to Ivana."
Initially limited to female voices, Apple announced in June 2013 that Siri would feature a gender option, adding a male voice counterpart.
In September 2014, Apple added the ability for users to speak "Hey Siri" to enable the assistant without the requirement of physically handling the device.
In September 2015, the "Hey Siri" feature was updated to include individualized voice recognition, a presumed effort to prevent non-owner activation.
With the announcement of iOS 10 in June 2016, Apple opened up limited third-party developer access to Siri through a dedicated application programming interface. The API restricts usage of Siri to engaging with third-party messaging apps, payment apps, ride-sharing apps, and Internet calling apps.
In iOS 11, Siri is able to handle follow-up questions, supports language translation, and opens up to more third-party actions, including task management. Additionally, users are able to type to Siri, and a new, privacy-minded "on-device learning" technique improves Siri's suggestions by privately analyzing personal usage of different iOS applications.
Reception
Siri received mixed reviews during its beta release as an integrated part of iPhone 4S in October 2011.MG Siegler of TechCrunch wrote that Siri was "great," praising the potential for Siri after losing the beta tag:
"The amount of times Siri hasn't been able to understand and execute my request is astonishingly low.... Just imagine what will happen when Apple partners with other services to expand Siri further. And imagine when they have an API that any developer can use. This really could alter the mobile landscape."
Writing for The New York Times, David Pogue also praised Siri's language understanding and ability to understand context:
" thinks for a few seconds, displays a beautifully formatted response and speaks in a calm female voice.... It's mind-blowing how inexact your utterances can be. Siri understands everything from, 'What's the weather going to be like in Tucson this weekend?' to 'Will I need an umbrella tonight?'... Once, I tried saying, 'Make an appointment with Patrick for Thursday at 3.' Siri responded, 'Note that you already have an all-day appointment about "Boston Trip" for this Thursday. Shall I schedule this anyway?' Unbelievable."
Jacqui Cheng of Ars Technica wrote that Apple's claims of what Siri could do were bold, and the early demos "even bolder":
"Though Siri shows real potential, these kinds of high expectations are bound to be disappointed.... Apple makes clear that the product is still in beta—an appropriate label, in our opinion."While praising its ability to "decipher our casual language" and deliver "very specific and accurate result," sometimes even providing additional information, Cheng noted and criticized its restrictions, particularly when the language moved away from "stiffer commands" into more human interactions. One example included the phrase "Send a text to Jason, Clint, Sam, and Lee saying we're having dinner at Silver Cloud," which Siri interpreted as sending a message to Jason only, containing the text "Clint Sam and Lee saying we're having dinner at Silver Cloud." She also noted a lack of proper editability, as saying "Edit message to say: We're at Silver Cloud and you should come find us," generated "Clint Sam and Lee saying we're having dinner at Silver Cloud to say we're at Silver Cloud and you should come find us."
Google's executive chairman and former chief, Eric Schmidt, conceded that Siri could pose a competitive threat to the company's core search business.
Siri was criticized by pro-abortion rights organizations, including the American Civil Liberties Union and NARAL Pro-Choice America, after users found that Siri could not provide information about the location of birth control or abortion providers nearby, sometimes directing users to crisis pregnancy centers instead.
Natalie Kerris, a spokeswoman for Apple, told The New York Times:
"Our customers want to use Siri to find out all types of information, and while it can find a lot, it doesn't always find what you want.... These are not intentional omissions meant to offend anyone. It simply means that as we bring Siri from beta to a final product, we find places where we can do better, and we will in the coming weeks."
In January 2016, Fast Company reported that, in then-recent months, Siri had begun to confuse the word "abortion" with "adoption", citing "health experts" who stated that the situation had "gotten worse." However, at the time of Fast Companys report, the situation had changed slightly, with Siri offering "a more comprehensive list of Planned Parenthood facilities", although "Adoption clinics continue to pop up, but near the bottom of the list."
Siri has also not been well received by some English speakers with distinctive accents, including Scottish and Americans from Boston or the South.
In March 2012, Frank M. Fazio filed a class action lawsuit against Apple on behalf of the people who bought iPhone 4S and felt misled about the capabilities of Siri, alleging its failing to function as depicted in Apple's Siri commercials. Fazio filed the lawsuit in California and claimed that the iPhone 4S was merely a "more expensive iPhone 4" if Siri fails to function as advertised. On July 22, 2013, U.S. District Judge Claudia Wilken in San Francisco dismissed the suit but said the plaintiffs could amend at a later time. The reason given for dismissal was that plaintiffs did not sufficiently document enough misrepresentations by Apple for the trial to proceed.
Perceived lack of innovation
In June 2016, The Verges Sean O'Kane wrote about the then-upcoming major iOS 10 updates, with a headline stating "Siri's big upgrades won't matter if it can't understand its users":"What Apple didn't talk about was solving Siri's biggest, most basic flaws: it's still not very good at voice recognition, and when it gets it right, the results are often clunky. And these problems look even worse when you consider that Apple now has full-fledged competitors in this space: Amazon's Alexa, Microsoft's Cortana, and Google's Assistant."Also writing for The Verge, Walt Mossberg had previously questioned Apple's efforts in cloud-based services, writing:
"...perhaps the biggest disappointment among Apple's cloud-based services is the one it needs most today, right now: Siri. Before Apple bought it, Siri was on the road to being a robust digital assistant that could do many things, and integrate with many services—even though it was being built by a startup with limited funds and people. After Apple bought Siri, the giant company seemed to treat it as a backwater, restricting it to doing only a few, slowly increasing a number of tasks, like telling you the weather, sports scores, movie and restaurant listings, and controlling the device's functions. Its unhappy founders have left Apple to build a new AI service called Viv. And, on too many occasions, Siri either gets things wrong, doesn't know the answer, or can't verbalize it. Instead, it shows you a web search result, even when you're not in a position to read it."
In October 2016, Bloomberg reported that Apple had plans to unify the teams behind its various cloud-based services, including a single campus and reorganized cloud computing resources aimed at improving the processing of Siri's queries, although another report from The Verge, in June 2017, once again called Siri's voice recognition "bad."
In June 2017, The Wall Street Journal published an extensive report on the lack of innovation with Siri following competitors' advancement in the field of voice assistants. Noting that Apple workers' anxiety levels "went up a notch" on the announcement of Amazon's Alexa, the Journal wrote: "Today, Apple is playing catch-up in a product category it invented, increasing worries about whether the technology giant has lost some of its innovation edge." The report gave the primary causes being Apple's prioritization of user privacy, including randomly-tagged six-month Siri searches, whereas Google and Amazon keep data until actively discarded by the user and executive power struggles within Apple. Apple did not comment on the report, while Eddy Cue said: "Apple often uses generic data rather than user data to train its systems and has the ability to improve Siri's performance for individual users with information kept on their iPhones."
Swearing
The iOS version of Siri ships with a vulgar content filter; however, it is disabled by default and must be enabled by the user manually. Therefore, if Siri can be triggered into cursing, broadcasting discriminatory content, and so on, the actions will most likely be carried out. Over its history, multiple methods and techniques have been used to trigger Siri into swearing. The language-filter is not perfect and can still be bypassed.In 2018, Ars Technica reported a new glitch that could be exploited by a user requesting the definition of "mother" be read out loud. Siri would issue a response and ask the user if they would like to hear the next definition; when the user replies with "yes," Siri would mention "mother" as being short for "motherfucker." This resulted in multiple YouTube videos featuring the responses and/or how to trigger them. Apple fixed the issue silently, although it is unconfirmed if the videos specifically brought it to their attention. The content is picked up from third-party sources such as the Oxford English Dictionary and not a supplied message from the corporation. It is unknown whether this glitch was exploited with the vulgar content filter enabled.