|
||
When I first started learning on Machine Learning and reading a lot of documents, Watching tons of YouTubes.. a question that pops up in my mind was
I think it is natural that you get this kind of questions whenever you are trying to get into an area that is in a kind of hype and everybody is talking about it. One way to get a relatively clear view (or anwser) to this kind of question would be to widen your scope of study a little bit further and try to study the whole flow of the technology being utilized in real world rather than sticking to the software tools or algorithms etc.
Overall Machine Learning PipelineIn my view, overall flow of Machine Learning can be illustrated as follows. The first step (1) is to collect the data from real life activity. The type of data to be collected would be different depending on which company you are working for. In most cases, it is highly likely that the data is already there in the organization and the organization decided to apply Machine Learning in the hope that they can get more useful information from the data. And based on the data and the business requirement, you may need to think up of what kind of specific algorithm to apply and what should be the input and output for the algorithm. Once the algorithm is determined, you would need to process the data in such a format that can be fed into the algorithm you want to execute. If I breakdown the three major process labeled above and add a few more steps which comes not from the technology but from business point of view, the list can be extended as follows. (1) Determine What is the outcome you want to achieve ? (2) Determine What is the business impact ? (3) Determine What kind of data you need to collect in real life ? (4) Determine What kind of Algorithm you want to choose ? (5) Figure out How do you implement the algorithm in a certain tools (e.g, TensorFlow, Pytorch etc) (6) Can you justify on why you chose the algorithm ? (7) Figure out How to do process the raw data in such a way that it fits the input of your algorithm ? (8) In some rare case, you don't find any proper algorithm among the existing ones that completely fits your purpose of analysis. In this case, you may need to come up with new algorithm. I think most of the tutorials and tech blogs / videos are focused on Algorithm part. That is, mostly about item (4),(5) in the following list. But in reality, there are many other steps are involved in the overall data flow as listed below. I am not saying every engineers in the area of Machine Learning but I think I can say that it would be rare case that you will be working only in (4) or (5) unless you are working on inventing new algorithm in Academia. Even though your major job is with (4),(5) in your workplace you would need to do at least a few other parts as well. From reviewing many of real life use cases and interview video from those working in various area applying machine learning, step (7) is one of the largest portions of what many of Machine Learning engineers have to do in their real work even though there are not so many people who really enjoys it. This is the reality for most of the engineering job. Before you jump into the area and doing the real job, everything may look fancy. But once you get yourself into the job, you may find most of the task given to you may be those that you've never expected before and you don't like much. Common Data-PreprocessFollowings are some common examples of what type of data processing you would need to perform before you put the origian data (i.e, data that you collected) to a specific machine learning model that you want to use. It doesn't mean that you always need to do all of these data processing to all of your model. Depending on the machine learning model and the format of raw data, the type of preprocessing tasks would vary. I am just trying to make a list of common/frequent form of pre processing. If you are not so familiar with computer programming language and try to know where I should start in terms of computer programming on machine learning or data science, I would suggest you to pick up a specific language that you like and make a lot of practice for writing programs to do this kind of tasks listed below. By donig that, I think you can learn programming and machine learning at the same time. Examples of What they do (Applications / Use Cases)In this section, I would try to list of videos based on what big players in the AI/ML industry has been doing. I am listing the videos mostly from those presentations directly presented by specific companies that are listed. This is a little bit on purpose. I think the presentation directly coming from the company would best describe on the big picture of what they are doing and what they 'intend to' do. Also I am trying to list those presentations showing the various types of input and output to the machine learning system. Since most of the list are about big picture or business model, they do not carry the much technical details. For more technical issues, big trends in terms of technology (not in terms of buisiness), various courses, I am listing in another page here.
Machine Learning at Google
Machine Learning at Facebook
Machine Learning at Microsoft
Machine Learning at Amazon
Machine Learning at Apple
Machine Learning at Cisco
Machine Learning at AutoDesk
Machine Learning at Ericsson
Machine Learning at Verizon
Machine Learning at Qualcomm
Semantic Segmentation of ImageThis explains very intuitively on what is Semantic Segmentation, what it is used for and how to prepare the labeled data for training. Predictive MaintenanceThis shows a good example of how to define a meaningful feagures from various sensors in a pump system and how to process those data that fits Machine Learning Algorithm.
Natural Language ProcessingThis use case shows a case where the system takes in customer support message given in the form of natural language (e.g, text message) and analyze it, suggest possible root causes, treatment. In this presentation, you would learn not only learn about an application of machine learning but also on how to justify this application in terms of business. Churn PredictionThis use case shows the case to predict whether a customer would change the carrier (the carrier he/she subscribe) from the given set of customer history data.
Fraud DetectionThis use case shows an example on how to detect Fraud from a given dateset of historic data of card holders. Forecasting Airline PassengerAs flight booking goes one, passengers book a flight but some passengers cancel the booking. How to predict the booking rate for a flight ? Networking
Wireless Communication / PHY , Transport Channel
Machine Learning for 5G
Machine Learning for Smart Phone
Machine Learning with MatlabchatGPT and others
|
||