# Coursera machine learning week 10 Quiz answers Large Scale | Andrew NG

In this article, you will find Coursera machine learning week 10 Quiz answers Large Scale. Use “Ctrl+F” To Find Any Questions or Answers. For Mobile Users, You Just Need To Click On Three dots In Your Browser & You Will Get A “Find” Option There. Use These Options to Get Any Random Questions Answer.
Try to solve all the assignments by yourself first, but if you get stuck somewhere then feel free to browse the code. Don’t just copy-paste the code for the sake of completion.  Even if you copy the code, make sure you understand the code first.

### Coursera machine learning week 10 Quiz answers Large Scale| Andrew NG

1. Suppose you are training a logistic regression classifier using stochastic gradient descent. You find that the cost (say, $inline&space;cost(theta,(x^{(i)},y^{(i)}))$, averaged over the last 500 examples), plotted as a function of the number of iterations, is slowly increasing over time. Which of the following changes are likely to help?

•  Try using a smaller learning rate α.
•  Try averaging the cost over a larger number of examples (say 1000 examples instead of 500) in the plot.
•  This is not an issue, as we expect this to occur with stochastic gradient descent.
•  Try using a larger learning rate α.
•  Use fewer examples from your training set.
•  Try halving (decreasing) the learning rate α, and see if that causes the cost to now consistently go down; and if not, keep halving it until it does.
•  This is not possible with stochastic gradient descent, as it is guaranteed to converge to the optimal parameters θ.
•  Try averaging the cost over a smaller number of examples (say 250 examples instead of 500) in the plot.

2. Which of the following statements about stochastic gradient descent are true?
Check all that apply.

•  One of the advantages of stochastic gradient descent is that it can start progress in improving the parameters θ after looking at just a single training example; in contrast, batch gradient descent needs to take a pass over the entire training set before it starts to make progress in improving the parameters’ values.
•  Stochastic gradient descent is particularly well suited to problems with small training set sizes; in these problems, stochastic gradient descent is often preferred to batch gradient descent.
•  In each iteration of stochastic gradient descent, the algorithm needs to examine/use only one training example.
•  Before running stochastic gradient descent, you should randomly shuffle (reorder) the training set.
•  If you have a huge training set, then stochastic gradient descent may be much faster than batch gradient descent.

3. Which of the following statements about online learning are true? Check all that apply.

•  One of the disadvantages of online learning is that it requires a large amount of computer memory/disk space to store all the training examples we have seen.
•  In the approach to online learning discussed in the lecture video, we repeatedly get a single training example, take one step of stochastic gradient descent using that example, and then move on to the next example.
•  One of the advantages of online learning is that there is no need to pick a learning rate α.
•  When using online learning, in each step we get a new example (x, y), perform one step of (essentially stochastic gradient descent) learning on that example, and then discard that example and move on to the next.
•  When using online learning, you must save every new training example you get, as you will need to reuse past examples to re-train the model even after you get new training examples in the future.
•  Online learning algorithms are most appropriate when we have a fixed training set of size m that we want to train on.
•  One of the advantages of online learning is that if the function we’re modeling changes over time (such as if we are modeling the probability of users clicking on different URLs, and user tastes/preferences are changing over time), the online learning algorithm will automatically adapt to these changes.
•  Online learning algorithms are usually best suited to problems were we have a continuous/non-stop stream of data that we want to learn from.

4. Assuming that you have a very large training set, which of the following algorithms do you think can be parallelized using map-reduce and splitting the training set across different machines? Check all that apply.

•  A neural network trained using batch gradient descent.
•  Linear regression trained using batch gradient descent.
•  An online learning setting, where you repeatedly get a single example (x, y), and want to learn from that single example before moving on.
•  Logistic regression trained using stochastic gradient descent.
•  Logistic regression trained using batch gradient descent.
•  Logistic regression trained using stochastic gradient descent.
•  Linear regression trained using stochastic gradient descent.

5. Which of the following statements about map-reduce are true? Check all that apply.

•  When using map-reduce with gradient descent, we usually use a single machine that accumulates the gradients from each of the map-reduce machines, in order to compute the parameter update for that iteration.
•  Because of network latency and other overhead associated with map-reduce, if we run map-reduce using N computers, we might get less than an N-fold speedup compared to using 1 computer.
•  If you have only 1 computer with 1 computing core, then map-reduce is unlikely to help.
•  If we run map-reduce using N computers, then we will always get at least an N-fold speedup compared to using 1 computer.
•  Running map-reduce over N computers requires that we split the training set into  pieces.
•  In order to parallelize a learning algorithm using map-reduce, the first step is to figure out how to express the main work done by the algorithm as computing sums of functions of training examples.

Disclaimer:  Hopefully, this article will be useful for you to find all the Coursera machine learning week 10 Quiz answers Large Scale and grab some premium knowledge with less effort.
Finally, we are now, in the end, I just want to conclude some important message for you, Feel free to ask doubts in the comment section. I will try my best to answer it. If you find this helpful by any means like, comment, and share the post. Please share our posts on social media platforms and also suggest to your friends to Join Our Groups. Don’t forget to subscribe. This is the simplest way to encourage me to keep doing such work.

### FAQs

Is Andrew Ng’s Machine Learning course good?
It is the Best Course for Supervised Machine Learning! Andrew Ng Sir has been like always has such important & difficult concepts of Supervised ML with such ease and great examples, Just amazing!
How do I get answers to coursera assignment?
Use “Ctrl+F” To Find Any Questions Answered. & For Mobile Users, You Just Need To Click On Three dots In Your Browser & You Will Get A “Find” Option There. Use These Options to Get Any Random Questions Answer.
How long does it take to finish coursera Machine Learning?
this specialization requires approximately 3 months with 75 hours of materials to complete, and I finished it in 3 weeks and spent an additional 1 week reviewing the whole course.
How do you submit assignments on Coursera Machine Learning?
Submit a programming assignment Open the assignment page for the assignment you want to submit. Read the assignment instructions and download any starter files. Finish the coding tasks in your local coding environment. Check the starter files and instructions when you need to. Reference

Sharing Is Caring