Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

Category: Reddit MachineLearning

[R] why doesn’t this Ubuntu vm feel native (quad i7 @3.4 Ghz, 16 GB, Windows 10, nothing else running) in vmware?

Here is the image:

It’s a a fully configured deep learning VM. It’s about 7.7GB.

I am running on:

  • an i7 3770 quad core at 3.4 Ghz (not the i7 3770K which has no VT-d virtualization extensions – this one does.)

  • 16 GB of DDR 3 ram that is clocked at 1600 mhz. I benchmarked it at 19000 MB/sec in Novabench.

  • The GPU in the host is 1060 GPU with 6 GB of RAM – but the image file I linked above explains “Due to licensing and installation complications, there’s no GPU acceleration / CUDA support provided. So you don’t need an Nvidia GPU to try this out, but it also won’t take advantage of a GPU if you have one.”

  • windows 10.

  • I have an SSD. It’s not the fastest but the write speed is 214 MB/seconds and the read speed is 253 MB/seconds in Novabench.

  • I turned the swap off entirely on the host system and have plenty of RAM left.

Without anything else running on the host computer, which is a pretty fresh install, this VM doesn’t feel native at all when entering full screen, regardless if I give it 2 GB (default), 4 GB, or even 8 GB of memory. I turned on the virtualization extensions in my bios and in the vmware settings too! I went from giving 1 CPU to 2 CPU’s to the guest. None of this helps.

why deson’t it feel “native”? I mean things like moving the mouse around, opening a new firefox windows, etc. It’s “obviously” a VM.

Is it not supposed to feel native or buttery smooth? I’m just at a loss. Additionally, I did an update from within it and it updated Ubuntu files at 500 KB/second! My connection is several megabytes per second.

I feel like I must be missing something….but what??

submitted by /u/vmcuriouss
[link] [comments]

[R] Thoughts on Mutual Information: More Estimators and Formal Limitations

Mutual Information is an important measure of dependence between two variables and is often used in ML for various different reasons (e.g. representation learning). In the last year there were some new results and papers which made me curious about the topic and I wanted to share some of my thoughts.

In the first post I introduce a couple of new estimators based on bounds on the log marginal likelihood. Some of them are very appealing as they give us 1) both lower and upper bounds, 2) a way to make bounds tighter by putting more computation into it.

In the second post I move to the analysis of (some of) these estimators, as well as address the Formal Limitations paper – a paper that, loosely speaking, forbids good only-samples-based (blackbox) bounds on the Mutual Information. In particular, I show how this issue manifests itself in several widely used blackbox bounds, and then contrast it with bounds that use some distribution knowledge.

Discussion might not be quite introductory, so I recommend checking out a recent ICML paper On Variational Bounds of Mutual Information.

submitted by /u/asobolev
[link] [comments]

[P] Pose detection for Google Coral EdgeTPU

[P] Pose detection for Google Coral EdgeTPU

this is the demo for showing how to use the PoseNet model to detect human poses from images and video, such as where someone’s elbow, shoulder or foot appear in the image.

example code to shows how to run it on a camera stream.

code here: https://github.com/google-coral/project-posenet

https://i.redd.it/htrtp01zbkg31.gif

pppt: before you run this repo, you should get yourself a coral deb board or coral usb : suggest this site

submitted by /u/makereven
[link] [comments]

[P] Simple PyTorch implementation of Autoregressive Language Model on Wikipedia text

A step-by-step tutorial on how to implement and adapt Autoregressive language model to Wikipedia text.

A pre-trained BERT, XLNET is publicly available ! But, for NLP beginners, It could be hard to use/adapt after full understanding. For them, I covered whole, end-to-end implementation process for language modeling, using unidirectional/bidirectional LSTM network, we already know.

  • – do not use torchtext library !
  • + include trained model file, training logs

I hope that this repo can be a good solution for people who want to have their own language model 🙂

https://github.com/lyeoni/pretraining-for-language-understanding

submitted by /u/lyeoni
[link] [comments]

[N] Submissions now open for the NeurIPS 2019 MineRL Competition on Sample Efficient RL

[N] Submissions now open for the NeurIPS 2019 MineRL Competition on Sample Efficient RL

Hey friends!

Super excited to announce that after two months of bug squashing & adventures in the Minecraft code base, the MineRL competition is now open for submissions! The general blog post about how to get started is here:

http://minerl.io/blog/

Protocol for submission 🙂

Here’s how you submit in Round 1:

  1. Sign up to join the competition on the AIcrowd website.
  2. Clone the AIcrowd starter template and start developing your submissions.
  3. Submit an agent to the leaderboard:
  • Train your agents locally (or on Azure) in under 8,000,000 samples over 4 days. Participants should use hardware no more powerful than NG6v2 instances on Azure (6 CPU cores, 112 GiB RAM, 736 GiB SDD, and a NVIDIA P100 GPU.)
  • Push your repository to AIcrowd GitLab, which verifies that it can successfully be re-trained by the organizers at the end of Round 1 and then runs the test entrypoint to evaluate the trained agent’s performance!

Once the full evaluation of the uploaded model/code is done, the participant’s submission will appear on the leaderboard!

submitted by /u/MadcowD
[link] [comments]

[P] Music Transformer ( Huang et al, Google Brain, ICLR2019 ) Re-Implementation ( Tensorflow 2.0 )

Hi, r/MachineLearning !

As I finished implementation of Google’s MusicTransformer,

I want to share it to here.

  • library : Tensorflow2.0 (beta)

  • training env : v100 x 1GPU

paper : https://arxiv.org/abs/1809.04281

github : https://github.com/jason9693/MusicTransformer-tensorflow2.0

Google Magenta blog : https://magenta.tensorflow.org/music-transformer

For more details, you can see in README.md

Thank you 🙂

ps) A video below here is generated sample after listened “Boy with Love (BTS)”

https://reddit.com/link/cqk3to/video/mwnl0yfo0jg31/player

submitted by /u/LIKELIH00D
[link] [comments]

[D] how can I get a global minimum

[D] how can I get a global minimum

here is my code

for _ in range(10): K.clear_session() model = Sequential() model.add(LSTM(256, input_shape=(None, 1))) model.add(Dropout(0.2)) model.add(Dense(256)) model.add(Dropout(0.2)) model.add(Dense(1)) model.compile(loss='mean_squared_error', optimizer='adam') hist = model.fit(x_train, y_train, epochs=20, batch_size=64, verbose=0) p = model.predict(x_test) print(mean_squared_error(y_test, p)) plt.plot(y_test) plt.plot(p) plt.legend(['testY', 'p'], loc='upper right') plt.show() ... plt.plot(hist.history['loss']) 

`Total params` : 330,241

`samples` : 2264

and below is the result

https://i.redd.it/2gjgr2uonig31.png

I haven’t changed anything.

I only ran for loop.

As you can see in the picture, the result of the MSE is huge, even though I have just run the for loop.

I think the fundamental reason for this problem is that the optimizer can not find global maximum and find local maximum and converge.

The reason is that after checking all the loss graphs, the loss is no longer reduced significantly. (After 20 times)

So in order to solve this problem, I have to find the global minimum. How should I do this?

I tried adjusting the number of batch_size, epoch.

Also, I tried hidden layer size, LSTM unit, kerner_initializer addition, optimizer change, etc. but could not get any meaningful result.

I wonder how can I solve this problem.

Your valuable opinions and thoughts will be very much appreciated.

if you want to see full source here is link https://gist.github.com/Lay4U/e1fc7d036356575f4d0799cdcebed90e

submitted by /u/GoBacksIn
[link] [comments]

[P] Introduction to Optimization with Genetic Algorithm

Selection of the optimal parameters values for machine learning tasks is challenging. Some results may be bad not because the data is noisy or the used learning algorithm is weak, but due to the bad selection of the parameters values. This article gives a brief introduction about evolutionary algorithms (EAs) and describes genetic algorithm (GA) which is one of the simplest random-based EAs.

https://www.linkedin.com/pulse/introduction-optimization-genetic-algorithm-ahmed-gad

submitted by /u/ahmedfgad
[link] [comments]