My own cloud computing crash course.
Someone once told me: “If you still download your data, you’re not in 2020.”
For those who don’t know, this means that we are now in the age of cloud computing. As a data scientist, not knowing cloud computing is a sin (at least I have begun to think so).
什么是云计算？ (What is Cloud Computing?)
By Wikipedia, cloud computing is “the on-demand availability of computer system resources, especially data storage (cloud storage) and computing power, without direct active management by the user,” which means that you can store data elsewhere than your computer, still has access to it but with the security that it is not shared with everyone.
More practically, these are all the cloud services you probably have used already if you own a smartphone.
and also this
also Apple Cloud, just to name a few.
Essentially, if you can access your information and data from a different device with an online account, you are using the cloud service.
I still remember the day when my mom’s phone broke down, and we lost all the photos of my precious baby sister. That was before the cloud, and everything was stored on her iPhone. Now, if her phone breaks down, she can buy a new one and download all the data from her old phone from her cloud account. My company also stores everything on Google Drive with the enterprise account so that we can share and collaborate on documents online without having to send excel sheets over to one another as I used to at a different company.
我仍然记得那天妈妈的电话坏了，我们丢掉了我宝贵的小妹妹的所有照片。 那是在云端之前，所有东西都存储在她的iPhone中。 现在，如果手机出现故障，她可以购买新手机并从其云帐户下载旧手机中的所有数据。 我的公司还使用企业帐户将所有内容存储在Google云端硬盘中，这样我们就可以在线共享文档并在文档上进行协作，而不必像我以前在另一家公司那样互相发送excel表。
Cloud computing relieves you from waiting time before someone takes a look at your documents and then sending it back and forth. It also allows the computer to work faster on bigger datasets because it does not have to store and load this data on its own. There’s also that back up and security component that takes away the stress and financial burden when the hardware is broken or lost (building caught on fire? iPhone dropped down the drain?).
云计算使您不必等待别人查看您的文档然后来回发送的时间。 它还使计算机可以在更大的数据集上更快地工作，因为它不必自己存储和加载这些数据。 还有备份和安全组件，可以在硬件损坏或丢失(建筑物着火了？iPhone掉进水里了吗？)时减轻了压力和财务负担。
云服务的类型 (Types of cloud services)
When it comes to users and customers of cloud computing services, we have a beautiful image to help visualize the different tiers.
First, there is the SaaS, then the PaaS, and lastly, the IaaS.
SaaS is the host of end-user tools and services that I showed above, with Google Drive, MS Office, Apple iCloud, and everything of the kind.
SaaS是我上面显示的最终用户工具和服务的宿主，这些工具和服务包括Google Drive，MS Office，Apple iCloud以及所有类似类型的东西。
PaaS is typically for developers who design websites, develop apps, and manage databases. Data scientists can optimize the cloud to manage their business database as I do with Google Sheets. However, when it comes to database cloud services, Google Sheets is not the best for big datasets because it’s limited to only 1M rows, and it takes quite long to load if your database structure is complicated. More on that below.
PaaS通常适用于设计网站，开发应用程序和管理数据库的开发人员。 就像我使用Google表格一样，数据科学家可以优化云以管理其业务数据库。 但是，在数据库云服务方面，Google Sheets并不是大型数据集的最佳选择，因为它仅限于100万行，并且如果您的数据库结构复杂，则加载时间会很长。 下面的更多内容。
IaaS is a storage infrastructure, allowing companies to rent out the CPUs at data centers to store their data. Simply speaking, they rent out the computers, while PaaS lend out the time you can store data on their CPUs + some cool tools, and with SaaS you are just buying some time to store your data on their hard drive and not yours (like the iCloud storage you pay annually).
This image also helps to show what you manage and what your service provider manages for each service.
云服务提供商 (Cloud service providers)
Now we are getting to the service providers, our household names are Amazon Web Services, Microsoft Azure, and Google Cloud Platform. These three companies are the most popular and seemingly coming on top of the race as well.
现在我们要转到服务提供商，我们的姓氏是Amazon Web Services，Microsoft Azure和Google Cloud Platform。 这三家公司是最受欢迎的公司，似乎也位居榜首。
亚马逊网络服务 (Amazon Web Services)
AWS is the oldest cloud services since 2006 so it is no surprise that it is leading the way in the cloud computing sector. They have the highest number of data centers and has the most varied range of services. Notable clients of AWS are Netflix, LinkedIn, Facebook, Baidu, and ESPN.
AWS是自2006年以来最古老的云服务，因此它在云计算领域处于领先地位也就不足为奇了。 它们的数据中心数量最多，服务范围也最广泛。 AWS的著名客户是Netflix，LinkedIn，Facebook，百度和ESPN。
微软Azure (Microsoft Azure)
Launched in 2010 and is the second most common service for cloud computing. Microsoft alone takes up 20% market share (AWS takes up about 40%) and also offers many services, While AWS remains the most popular with user-friendly features, MS Azure is great for companies that are already committed to MS products and services as the data migration process to the cloud would be more seamless. Their notable clients are eBay, Boeing, Samsung, and BMW.
该服务于2010年推出，是第二大最常见的云计算服务。 仅微软一家就占据了20％的市场份额(AWS占据了约40％的份额)，并且还提供了许多服务。尽管AWS凭借其用户友好型功能仍然是最受欢迎的产品，但是MS Azure对于已经致力于MS产品和服务的公司来说非常适合到云的数据迁移过程将更加无缝。 他们著名的客户是eBay，波音，三星和宝马。
Google Cloud Platform (Google Cloud Platform)
Launched in 2011 with significantly fewer data centers globally compared to the other two, Google is catching up with 10% of market share. Initially, GCP was built to enhance Google’s own products, including Google Services and Youtube, but has since successfully expanded their service capability. GCP also has proven to have incredible processing speed compared to its two competitors. Many big companies also vouches for its potential, including HTC, Sony Music, Coca Cola, Evernote, and more.
Google成立于2011年，与全球其他两个数据中心相比，其数据中心数量明显减少，Google的市场份额已赶超10％。 最初，GCP旨在增强Google自身的产品，包括Google Services和Youtube，但此后成功扩展了其服务能力。 与两个竞争对手相比，GCP还被证明具有惊人的处理速度。 许多大公司还利用其潜力，包括HTC，索尼音乐，可口可乐，印象笔记等。
While there are many other cloud services for different types of users, many still prefer AWS for being the trailblazer with the expertise in cloud computing. But with the current trend of everything being moved to the cloud, more companies might be able to bring much surprises.
Either way, cloud computing is the future and it is just as essential to learn as Microsoft Excel a few decades ago. So I’d say that — yes, it is time to kickstart some Cloud computing crash course and this is just the beginning.
无论哪种方式，云计算都是未来，它与几十年前的Microsoft Excel一样重要。 所以我要说-是的，是时候启动一些云计算崩溃课程了，这仅仅是开始。