Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

Category: Reddit MachineLearning

[P] Neural Network Model Builder & Visualiser | Netbrix.ml

[P] Neural Network Model Builder & Visualiser | Netbrix.ml

https://netbrix.ml/

Introduction

I recently discovered the Mithril.js JavaScript library and wanted a project to build up my skills with it! I ended up going with a simple web app for visualising and editing network models which I’ve named netbrix.ml. I’ve wanted to build something like this for a while since it seemed like a really good project to improve my web development skills and my understanding of the process of building deep learning models. After reading this post on /r/deeplearning where the writer gives insight into the modular nature of deep learning and gives the analogy of a deep learning ‘lego set’ it gave me the motivation to start work on this with that sort of vision in mind and I’ve now got a decent working web app!

I’m sure there are existing tools similar to this in existence, so I wanted to keep it as simple as possible and not try to over-engineer it. It’s meant to be easy and simple to use!

Importing Models

It has some cool features at the moment, like the ability to parse Sequential() model definitions in Keras e.g.

test_model = Sequential() test_model.add(Conv2D(64,(5,5), activation='relu')) test_model.add(MaxPooling2D((2,2))) test_model.add(Dropout(0.25)) 

(Alternatively you can paste in Keras JSON definitions from for example test_model.to_json())

Once that simple model definition is imported it will be parsed by the app to create the following neat visualisation:

https://i.redd.it/g60mc0qh2ll31.png

Editing Models

From here, you can make all the typical changes you would want to make to a model, including changing/adding layer parameters, adding new layers, changing the order of layers and removing layers, all without having to rely on Google to find the names of layers or their attributes. It can then be easily exported (or copied to the clipboard) with one click as either Python Keras code or a JSON spec which can be imported into Keras.

Building Models

It also has some nice features for building models from scratch, like the ability to add blocks of layers that come up in models frequently. Often it’s easy to forget the exact optimal order of layers for say a Convolutional block e.g. should pooling come before dropout or vice versa, and what about BatchNorm? Having preconfigured blocks of layers to choose from when building a model helps with this.

On top of that, just having an easily indexable list of layers is useful in itself.

Browsing Models

There is also a host of existing model architectures for a variety of machine learning tasks/datasets to explore and this is one of the most helpful features for me personally. Having an easy and centralised way to access a bunch of existing standard model architectures to take inspiration from is really useful! Rather than creating a model from scratch, you can find an existing model on the site, change say the input shape and a few of the hyperparameters and export it as working Keras code in just a few clicks. It’s also great for learning about the different architectures commonly used in building models.

Planned Features

  • Integration with TensorFlow.js to have some kind of in-browser training/prototyping
  • The ability to share your own model architectures
  • Support for variables. Currently you can specify meta tags for a model like ‘name’ and ‘source’. It would be cool to be able to specify variables in this same key/value fashion to allow even easier tuning.
  • Better mobile support. The site is relatively responsive at the moment, but not perfect. Hopefully you’re not building Keras models on your phone to begin with, but it will work if that’s something you wish to do 😁

So, I’d love to get some feedback on this project! Is it useful? Do you like the design? I’m open to criticism, this is primarily for me to learn 😁

Thanks for reading!

submitted by /u/DataSnaek
[link] [comments]

[D] Deep Learning rack server suggestions

Hello ML, my research lab just received a grant (25K €) for purchasing new hardware and we would like some recommendation so we can build the best deep learning rack. The team consists of 10-15 members. Our work involves different deep learning disciplines from images and videos to text or graphs.

We already have some servers with one or two GPU each shared between a few people and that worked out perfectly. Now we want to install a powerful rack with a setup similar to the DevBox spending all our budget. Our proposal, written one year ago:

Component Quantity
HPC 4u with space up to 8Gpus 1
Intel Skylake Xeon Gold Serie 6xxx o Power PC 9 2
32 Gb DDR4-2666 ECC REG 18
NVIDIA RTX 2080TI 4
Intel DC P4600 2TB NVMe PCIe 3.0 2
SATA3 2TB 6GB/s 7.200 RMP 1

Do you guys have any suggestion for a good/cost-efficient setup or some requirements that we must meet when configuring our rack server setup?

Thanks

submitted by /u/Gusinato95
[link] [comments]

[D] Forced alignment for low resource languages

So leave out Kaldi, Gentle, Montreal Fixed Alignment, etc. All those need detailed phonetic dictionaries etc which are only available for the most popular languages.

How does ML help if you can’t leverage ANY of the existing language toolsets? Does anyone have a good guess at possible ML solutions, can be semi supervised. Keep in mind when most of the work on forced alignment was done, it predates modern ML like deep neural networks etc.

Just want to hear your crazy ideas and hopefully I can take one and run with it. Am stumped right now with about 200 hours of unlabelled data with nothing to do with it except manual labelling which is soul crushing.

submitted by /u/sicp4lyfe
[link] [comments]

[D] 2nd Order Approximation in XGboost’s Objective Function

Hi all,

I have a quick question regarding XGboost’s objective function. I was reading the XGboost paper (https://arxiv.org/pdf/1603.02754.pdf). I see that authors approximated the original objective function using a 2nd order Taylor series (page 2, section 2.2). Is there a particular reason why it’s expanded to 2nd degree and not higher? I’m guessing that linear apprx. is not enough and higher orders require more computational power, but is there a mathematical background or is this a design choice?

submitted by /u/_kty
[link] [comments]

[Research] Rotated Mask RCNN

[Research] Rotated Mask RCNN

The Problem With MaskRCNN (and Bounding Boxes)

Due to bounding box ambiguity, Mask R-CNN fails in relatively dense scenes with objects of the same class, particularly if those objects have high bounding box overlap. In these scenes, both recall (due to NMS) and precision (foreground instance class ambiguity) are affected.

https://i.redd.it/gjnps7aunjl31.png

https://i.redd.it/vph52r9vnjl31.png

Rotated Mask R-CNN

Rotated Mask R-CNN resolves some of these issues by adopting a rotated bounding box representation. Extends Faster R-CNN, Mask R-CNN, or even RPN-only to work with rotated bounding boxes.

https://i.redd.it/mvv46t6snjl31.png

This work also builds on the Mask Scoring R-CNN (‘MS R-CNN’) paper by learning the quality of the predicted instance masks (maskscoring_rcnn).

Github Link: https://github.com/mrlooi/rotated_maskrcnn

submitted by /u/mrlooi
[link] [comments]