-
Notifications
You must be signed in to change notification settings - Fork 6
/
Copy path2020.07.14.txt
1673 lines (1373 loc) · 132 KB
/
2020.07.14.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
==========New Papers==========
1, TITLE: Probabilistic Jacobian-based Saliency Maps Attacks
http://arxiv.org/abs/2007.06032
AUTHORS: António Loison ; Théo Combey ; Hatem Hajri
COMMENTS: 21 pages
HIGHLIGHT: In this paper, we introduce Weighted JSMA (WJSMA) and Taylor JSMA (TJSMA), simple, faster and more efficient versions of JSMA.
2, TITLE: Automatic Lyrics Transcription using Dilated Convolutional Neural Networks with Self-Attention
http://arxiv.org/abs/2007.06486
AUTHORS: Emir Demirel ; Sven Ahlback ; Simon Dixon
HIGHLIGHT: This paper proposes a complete pipeline for this task which may commonly be referred as automatic lyrics transcription (ALT).
3, TITLE: Multi-Modality Information Fusion for Radiomics-based Neural Architecture Search
http://arxiv.org/abs/2007.06002
AUTHORS: Yige Peng ; Lei Bi ; Michael Fulham ; Dagan Feng ; Jinman Kim
COMMENTS: Accepted by MICCAI 2020
HIGHLIGHT: In this study, we propose a multi-modality neural architecture search method (MM-NAS) to automatically derive optimal multi-modality image features for radiomics and thus negate the dependence on a manual process.
4, TITLE: Usefulness of interpretability methods to explain deep learning based plant stress phenotyping
http://arxiv.org/abs/2007.05729
AUTHORS: Koushik Nagasubramanian ; Asheesh K. Singh ; Arti Singh ; Soumik Sarkar ; Baskar Ganapathysubramanian
COMMENTS: 15 pages, 6 figures
HIGHLIGHT: In this work, we compare some of the most popular interpretability methods: Saliency Maps, SmoothGrad, Guided Backpropogation, Deep Taylor Decomposition, Integrated Gradients, Layer-wise Relevance Propagation and Gradient times Input, for interpreting the deep learning model.
5, TITLE: TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
http://arxiv.org/abs/2007.06028
AUTHORS: Andy T. Liu ; Shang-Wen Li ; Hung-yi Lee
COMMENTS: Submitted to IEEE TASLP, currently under review
HIGHLIGHT: We introduce a self-supervised speech pre-training method called TERA, which stands for Transformer Encoder Representations from Alteration.
6, TITLE: Accelerating Translational Image Registration for HDR Images on GPU
http://arxiv.org/abs/2007.06483
AUTHORS: Kadir Cenk Alpay ; Kadir Berkay Aydemir ; Alptekin Temizel
COMMENTS: Submitted for Consideration for Publication in High Performance Computing Conference 2020
HIGHLIGHT: In this study, we optimize these computations using a parallel processing approach utilizing GPU.
7, TITLE: MeDaS: An open-source platform as service to help break the walls between medicine and informatics
http://arxiv.org/abs/2007.06013
AUTHORS: Liang Zhang ; Johann Li ; Ping Li ; Xiaoyuan Lu ; Peiyi Shen ; Guangming Zhu ; Syed Afaq Shah ; Mohammed Bennarmoun ; Kun Qian ; Björn W. Schuller
HIGHLIGHT: Based on a series of toolkits and utilities from the idea of RINV (Rapid Implementation aNd Verification), our proposed MeDaS platform can implement pre-processing, post-processing, augmentation, visualization, and other phases needed in medical image analysis.
8, TITLE: HSD Shared Task in VLSP Campaign 2019:Hate Speech Detection for Social Good
http://arxiv.org/abs/2007.06493
AUTHORS: Xuan-Son Vu ; Thanh Vu ; Mai-Vu Tran ; Thanh Le-Cong ; Huyen T M. Nguyen
HIGHLIGHT: The paper describes the organisation of the "HateSpeech Detection" (HSD) task at the VLSP workshop 2019 on detecting the fine-grained presence of hate speech in Vietnamese textual items (i.e., messages) extracted from Facebook, which is the most popular social network site (SNS) in Vietnam.
9, TITLE: Distributed Graph Convolutional Networks
http://arxiv.org/abs/2007.06281
AUTHORS: Simone Scardapane ; Indro Spinelli ; Paolo Di Lorenzo
COMMENTS: Preprint submitted to IEEE TSIPN
HIGHLIGHT: The aim of this work is to develop a fully-distributed algorithmic framework for training graph convolutional networks (GCNs).
10, TITLE: Strengthening neighbourhood substitution
http://arxiv.org/abs/2007.06282
AUTHORS: Martin C. Cooper
HIGHLIGHT: We show that the notion of neighbourhood substitution can be strengthened in two distinct ways without increasing time complexity.
11, TITLE: Term Revealing: Furthering Quantization at Run Time on Quantized DNNs
http://arxiv.org/abs/2007.06389
AUTHORS: H. T. Kung ; Bradley McDanel ; Sai Qian Zhang
COMMENTS: 13 pages, 19 figures, 4 tables, To appear in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2020
HIGHLIGHT: We present a novel technique, called Term Revealing (TR), for furthering quantization at run time for improved performance of Deep Neural Networks (DNNs) already quantized with conventional quantization methods.
12, TITLE: A simple defense against adversarial attacks on heatmap explanations
http://arxiv.org/abs/2007.06381
AUTHORS: Laura Rieger ; Lars Kai Hansen
COMMENTS: Accepted at 2020 Workshop on Human Interpretability in Machine Learning (WHI)
HIGHLIGHT: In our work we present an effective defence against such adversarial attacks on neural networks.
13, TITLE: Knowledge Graph Driven Approach to Represent Video Streams for Spatiotemporal Event Pattern Matching in Complex Event Processing
http://arxiv.org/abs/2007.06292
AUTHORS: Piyush Yadav ; Dhaval Salwala ; Edward Curry
COMMENTS: 31 pages, 14 Figures, Publication accepted in International Journal of Graph Computing
HIGHLIGHT: This work introduces a graph-based structure for continuous evolving video streams, which enables the CEP system to query complex video event patterns.
14, TITLE: Seeing Eye-to-Eye? A Comparison of Object Recognition Performance in Humans and Deep Convolutional Neural Networks under Image Manipulation
http://arxiv.org/abs/2007.06294
AUTHORS: Leonard E. van Dyck ; Walter R. Gruber
COMMENTS: 19 pages, 7 figures, 3 tables
HIGHLIGHT: This study aims towards a behavioral comparison of visual core object recognition between humans and feedforward neural networks in a classification learning paradigm on an ImageNet data set.
15, TITLE: Paranoid Transformer: Reading Narrative of Madness as Computational Approach to Creativity
http://arxiv.org/abs/2007.06290
AUTHORS: Yana Agafonova ; Alexey Tikhonov ; Ivan P. Yamshchikov
HIGHLIGHT: It presents a case study of a Paranoid Transformer - a fully autonomous text generation engine with raw output that could be read as the narrative of a mad digital persona without any additional human post-filtering.
16, TITLE: DeepHAZMAT: Hazardous Materials Sign Detection and Segmentation with Restricted Computational Resources
http://arxiv.org/abs/2007.06392
AUTHORS: Amir Sharifi ; Ahmadreza Zibaei ; Mahdi Rezaei
HIGHLIGHT: In this paper, we propose a CNN-Based pipeline called DeepHAZMAT for detecting and segmenting Hazmats in four steps; 1) optimising the number of input images that are fed into the CNN network, 2) using the YOLOv3-tiny structure to collect the required visual information from the hazardous areas, 3) Hazmat sign segmentation and separation from the background using GrabCut technique, and 4) post-processing the result with morphological operators and convex hall algorithm.
17, TITLE: A Feature Analysis for Multimodal News Retrieval
http://arxiv.org/abs/2007.06390
AUTHORS: Golsa Tahmasebzadeh ; Sherzod Hakimov ; Eric Müller-Budack ; Ralph Ewerth
HIGHLIGHT: In this paper, we investigate the usefulness of multimodal features for cross-lingual news search in various domains: politics, health, environment, sport, and finance.
18, TITLE: BoxE: A Box Embedding Model for Knowledge Base Completion
http://arxiv.org/abs/2007.06267
AUTHORS: Ralph Abboud ; İsmail İlkan Ceylan ; Thomas Lukasiewicz ; Tommaso Salvatori
HIGHLIGHT: Here, we propose a spatio-translational embedding model, called BoxE, that simultaneously addresses all these limitations.
19, TITLE: Contextual Bandit with Missing Rewards
http://arxiv.org/abs/2007.06368
AUTHORS: Djallel Bouneffouf ; Sohini Upadhyay ; Yasaman Khazaeni
HIGHLIGHT: In order to address the missing rewards setting, we propose to combine the standard contextual bandit approach with an unsupervised learning mechanism such as clustering.
20, TITLE: OpenStreetMap: Challenges and Opportunities in Machine Learning and Remote Sensing
http://arxiv.org/abs/2007.06277
AUTHORS: John Vargas ; Shivangi Srivastava ; Devis Tuia ; Alexandre Falcao
HIGHLIGHT: In this work, we present a review of recent methods based on machine learning to improve and use OSM data.
21, TITLE: On uncertainty estimation in active learning for image segmentation
http://arxiv.org/abs/2007.06364
AUTHORS: Bo Li ; Tommy Sonne Alstrøm
COMMENTS: Presented at ICML 2020 Workshop on Uncertainty & Robustness in Deep Learning
HIGHLIGHT: In this paper, we explore uncertainty calibration within an active learning framework for medical image segmentation, an area where labels often are scarce.
22, TITLE: Dual-Teacher: Integrating Intra-domain and Inter-domain Teachers for Annotation-efficient Cardiac Segmentation
http://arxiv.org/abs/2007.06279
AUTHORS: Kang Li ; Shujun Wang ; Lequan Yu ; Pheng-Ann Heng
COMMENTS: Accepted at MICCAI 2020
HIGHLIGHT: In this paper, we aim to investigate the feasibility of simultaneously leveraging abundant unlabeled data and well-established cross-modality data for annotation-efficient medical image segmentation.
23, TITLE: RNA-2QCFA: Evolving Two-way Quantum Finite Automata with Classical States for RNA Secondary Structures
http://arxiv.org/abs/2007.06273
AUTHORS: Amandeep Singh Bhatia ; Shenggen Zheng
COMMENTS: 9 pages, 3 figures
HIGHLIGHT: The main objective of this paper is on using two-way quantum finite automata with classical states to simulate, model and analyze the ribonucleic acid (RNA) sequences.
24, TITLE: RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning
http://arxiv.org/abs/2007.06271
AUTHORS: Riccardo Del Chiaro ; Bartłomiej Twardowski ; Andrew D. Bagdanov ; Joost van de Weijer
COMMENTS: 8 pages, 5 figures, to be published in LifelongML workshop at ICML2020
HIGHLIGHT: In this paper we take a systematic look at continual learning of LSTM-based models for image captioning.
25, TITLE: Screen Tracking for Clinical Translation of Live Ultrasound Image Analysis Methods
http://arxiv.org/abs/2007.06272
AUTHORS: Simona Treivase ; Alberto Gomez ; Jacqueline Matthew ; Emily Skelton ; Julia A. Schnabel ; Nicolas Toussaint
HIGHLIGHT: We propose a generic framework to extract the US images and superimpose the results of an analysis task, without any need for physical connection or alteration to the US system.
26, TITLE: Smart technology in the classroom: a systematic review.Prospects for algorithmic accountability
http://arxiv.org/abs/2007.06374
AUTHORS: Arian Garshi ; Malin Wist Jakobsen ; Jørgen Nyborg-Christensen ; Daniel Ostnes ; Maria Ovchinnikova
HIGHLIGHT: Based on our insights we propose a framework to effectively identify accountability for smart technology in education.
27, TITLE: Symmetric Dilated Convolution for Surgical Gesture Recognition
http://arxiv.org/abs/2007.06373
AUTHORS: Jinglu Zhang ; Yinyu Nie ; Yao Lyu ; Hailin Li ; Jian Chang ; Xiaosong Yang ; Jian Jun Zhang
COMMENTS: Accepted to MICCAI 2020
HIGHLIGHT: To tackle these challenges, we propose a novel temporal convolutional architecture to automatically detect and segment surgical gestures with corresponding boundaries only using RGB videos.
28, TITLE: Expert Training: Task Hardness Aware Meta-Learning for Few-Shot Classification
http://arxiv.org/abs/2007.06240
AUTHORS: Yucan Zhou ; Yu Wang ; Jianfei Cai ; Yu Zhou ; Qinghua Hu ; Weiping Wang
COMMENTS: 9 pages, 6 figures
HIGHLIGHT: Inspired by this idea, we propose an easy-to-hard expert meta-training strategy to arrange the training tasks properly, where easy tasks are preferred in the first phase, then, hard tasks are emphasized in the second phase.
29, TITLE: Transformer with Depth-Wise LSTM
http://arxiv.org/abs/2007.06257
AUTHORS: Hongfei Xu ; Qiuhui Liu ; Deyi Xiong ; Josef van Genabith
HIGHLIGHT: In this paper, we suggest that the residual connection has its drawbacks, and propose to train Transformers with the depth-wise LSTM which regards outputs of layers as steps in time series instead of residual connections, under the motivation that the vanishing gradient problem suffered by deep networks is the same as recurrent networks applied to long sequences, while LSTM (Hochreiter and Schmidhuber, 1997) has been proven of good capability in capturing long-distance relationship, and its design may alleviate some drawbacks of residual connections while ensuring the convergence.
30, TITLE: A theory of interaction semantics
http://arxiv.org/abs/2007.06258
AUTHORS: Johannes Reich
HIGHLIGHT: The aim of this article is to delineate a theory of interaction semantics and thereby provide a proper understanding of the "meaning" of the exchanged characters within an interaction.
31, TITLE: Exploring the Evolution of GANs through Quality Diversity
http://arxiv.org/abs/2007.06251
AUTHORS: Victor Costa ; Nuno Lourenço ; João Correia ; Penousal Machado
COMMENTS: Published in GECCO 2020
HIGHLIGHT: We propose in this paper the application of a quality-diversity algorithm in the evolution of GANs.
32, TITLE: Locality Guided Neural Networks for Explainable Artificial Intelligence
http://arxiv.org/abs/2007.06131
AUTHORS: Randy Tan ; Naimul Khan ; Ling Guan
COMMENTS: 8 pages, 3 figures, submitted to WCCI2020
HIGHLIGHT: In this paper, we propose a novel algorithm for back propagation, called Locality Guided Neural Network(LGNN) for training networks that preserves locality between neighbouring neurons within each layer of a deep network.
33, TITLE: DRWR: A Differentiable Renderer without Rendering for Unsupervised 3D Structure Learning from Silhouette Images
http://arxiv.org/abs/2007.06127
AUTHORS: Zhizhong Han ; Chao Chen ; Yu-Shen Liu ; Matthias Zwicker
COMMENTS: Accepted at ICML2020
HIGHLIGHT: In contrast, here we propose a Differentiable Renderer Without Rendering (DRWR) that omits these steps.
34, TITLE: SkyScapes -- Fine-Grained Semantic Understanding of Aerial Scenes
http://arxiv.org/abs/2007.06102
AUTHORS: Seyed Majid Azimi ; Corentin Henry ; Lars Sommer ; Arne Schumann ; Eleonora Vig
COMMENTS: Accepted in IEEE ICCV19
HIGHLIGHT: We therefore propose a novel multi-task model, which incorporates semantic edge detection and is better tuned for feature extraction from a wide range of scales.
35, TITLE: VINNAS: Variational Inference-based Neural Network Architecture Search
http://arxiv.org/abs/2007.06103
AUTHORS: Martin Ferianc ; Hongxiang Fan ; Miguel Rodrigues
COMMENTS: Submitted to ICPR'20 https://github.com/iiml-ucl/vinnas
HIGHLIGHT: To address these defects, we present a differentiable variational inference-based NAS method for searching sparse convolutional neural networks.
36, TITLE: Neural disambiguation of lemma and part of speech in morphologically rich languages
http://arxiv.org/abs/2007.06104
AUTHORS: José María Hoya Quecedo ; Maximilian W. Koppatz ; Giacomo Furlan ; Roman Yangarber
COMMENTS: This paper contains corrigenda to a previously published paper (Hoya Quecedo et al., 2020). It corrects a mistake in the original evaluation setup, and the results reported in Section 6., in Tables 5, 6, and 7
HIGHLIGHT: We propose a method for disambiguating ambiguous words in context, using a large un-annotated corpus of text, and a morphological analyser -- with no manual disambiguation or data annotation.
37, TITLE: Tabletop Roleplaying Games as Procedural Content Generators
http://arxiv.org/abs/2007.06108
AUTHORS: Matthew Guzdial ; Devi Acharya ; Max Kreminski ; Michael Cook ; Mirjam Eladhari ; Antonios Liapis ; Anne Sullivan
COMMENTS: 9 pages, 2 figures, FDG Workshop on Procedural Content Generation 2020
HIGHLIGHT: In this paper, we argue that TTRPG design can usefully be viewed as procedural content generator design.
38, TITLE: Graph Structure of Neural Networks
http://arxiv.org/abs/2007.06559
AUTHORS: Jiaxuan You ; Jure Leskovec ; Kaiming He ; Saining Xie
COMMENTS: ICML 2020
HIGHLIGHT: Here we systematically investigate how does the graph structure of neural networks affect their predictive performance.
39, TITLE: Long-Term Planning with Deep Reinforcement Learning on Autonomous Drones
http://arxiv.org/abs/2007.05694
AUTHORS: Ugurkan Ates
COMMENTS: Submitted to Association for the Advancement of Artificial Intelligence(AAAI) 2020 Fall Symposium Series
HIGHLIGHT: In this paper, we study a long-term planning scenario that is based on drone racing competitions held in real life.
40, TITLE: Free-running SIMilarity-Based Angiography (SIMBA) for simplified anatomical MR imaging of the heart
http://arxiv.org/abs/2007.06544
AUTHORS: John Heerfordt ; Kevin K. Whitehead ; Jessica A. M. Bastiaansen ; Lorenzo Di Sopra ; Christopher W. Roy ; Jérôme Yerly ; Bastien Milani ; Mark A. Fogel ; Matthias Stuber ; Davide Piccini
COMMENTS: 8 figures, 2 tables
HIGHLIGHT: We propose a novel fast reconstruction algorithm, applicable to ungated free-running sequences, that leverages inherent similarities in the acquired data to avoid such physiological constraints.
41, TITLE: Fast Video Object Segmentation With Temporal Aggregation Network and Dynamic Template Matching
http://arxiv.org/abs/2007.05687
AUTHORS: Xuhua Huang ; Jiarui Xu ; Yu-Wing Tai ; Chi-Keung Tang
COMMENTS: CVPR2020
HIGHLIGHT: In this paper, we introduce "tracking-by-detection" into VOS which can coherently integrate segmentation into tracking, by proposing a new temporal aggregation network and a novel dynamic time-evolving template matching mechanism to achieve significantly improved performance.
42, TITLE: Semi-steady-state Jaya Algorithm
http://arxiv.org/abs/2007.06463
AUTHORS: Uday K. Chakraborty
HIGHLIGHT: The present paper proposes a new, improved Jaya algorithm by modifying the update strategies of the best and the worst members in the population.
43, TITLE: Automatic Pass Annotation from Soccer VideoStreams Based on Object Detection and LSTM
http://arxiv.org/abs/2007.06475
AUTHORS: Danilo Sorano ; Fabio Carrara ; Paolo Cintia ; Fabrizio Falchi ; Luca Pappalardo
HIGHLIGHT: In this paper, we describe PassNet, a method to recognize the most frequent events in soccer, i.e., passes, from video streams.
44, TITLE: Learning Reasoning Strategies in End-to-End Differentiable Proving
http://arxiv.org/abs/2007.06477
AUTHORS: Pasquale Minervini ; Sebastian Riedel ; Pontus Stenetorp ; Edward Grefenstette ; Tim Rocktäschel
COMMENTS: Proceedings of the 37th International Conference on Machine Learning (ICML 2020)
HIGHLIGHT: We present Conditional Theorem Provers (CTPs), an extension to NTPs that learns an optimal rule selection strategy via gradient-based optimisation.
45, TITLE: Implicit Euler ODE Networks for Single-Image Dehazing
http://arxiv.org/abs/2007.06443
AUTHORS: Jiawei Shen ; Zhuoyan Li ; Lei Yu ; Gui-Song Xia ; Wen Yang
COMMENTS: 10pages, 10 figures, "for the associate project, see https://github.com/Jiawei-Shen?tab=repositories", submitted to CVPR workshop "vision for four seasons",
HIGHLIGHT: In this paper, we extend the explicit forward approximation to the implicit backward counterpart, which can be realized via a recursive neural network, named IM-block.
46, TITLE: Attention-guided Quality Assessment for Automated Cryo-EM Grid Screening
http://arxiv.org/abs/2007.05593
AUTHORS: Hong Xu ; Shireen Y. Elhabian ; David E. Timm
COMMENTS: Accepted for publication in MICCAI 2020, the 23rd International Conference on Medical Image Computing and Computer Assisted Intervention
HIGHLIGHT: Here, we focus on automating the early decision making for the microscope operator, scoring low magnification images of squares, and proposing the first deep learning framework, XCryoNet, for automated cryo-EM grid screening.
47, TITLE: EMIXER: End-to-end Multimodal X-ray Generation via Self-supervision
http://arxiv.org/abs/2007.05597
AUTHORS: Siddharth Biswal ; Peiye Zhuang ; Ayis Pyrros ; Nasir Siddiqui ; Sanmi Koyejo ; Jimeng Sun
HIGHLIGHT: To tackle this joint synthesis challenge, we propose an End-to-end MultImodal X-ray genERative model (EMIXER) for jointly synthesizing x-ray images and corresponding free-text reports, all conditional on diagnosis labels.
48, TITLE: Whitening for Self-Supervised Representation Learning
http://arxiv.org/abs/2007.06346
AUTHORS: Aleksandr Ermolov ; Aliaksandr Siarohin ; Enver Sangineto ; Nicu Sebe
HIGHLIGHT: In this paper we propose a different direction and a new loss function for self-supervised learning which is based on the whitening of the latent-space features.
49, TITLE: End-to-End Multi-Object Tracking with Global Response Map
http://arxiv.org/abs/2007.06344
AUTHORS: Xingyu Wan ; Jiakai Cao ; Sanping Zhou ; Jinjun Wang
HIGHLIGHT: To address the problem, we present a completely end-to-end approach that takes image-sequence/video as input and outputs directly the located and tracked objects of learned types.
50, TITLE: Learning and Exploiting Interclass Visual Correlations for Medical Image Classification
http://arxiv.org/abs/2007.06371
AUTHORS: Dong Wei ; Shilei Cao ; Kai Ma ; Yefeng Zheng
HIGHLIGHT: In this paper, we present the Class-Correlation Learning Network (CCL-Net) to learn interclass visual correlations from given training data, and produce soft labels to help with classification tasks.
51, TITLE: DeU-Net: Deformable U-Net for 3D Cardiac MRI Video Segmentation
http://arxiv.org/abs/2007.06341
AUTHORS: Shunjie Dong ; Jinlong Zhao ; Maojun Zhang ; Zhengxue Shi ; Jianing Deng ; Yiyu Shi ; Mei Tian ; Cheng Zhuo
HIGHLIGHT: In this paper, we propose a novel Deformable U-Net (DeU-Net) to fully exploit spatio-temporal information from 3D cardiac MRI video, including a Temporal Deformable Aggregation Module (TDAM) and a Deformable Global Position Attention (DGPA) network.
52, TITLE: A Label Attention Model for ICD Coding from Clinical Text
http://arxiv.org/abs/2007.06351
AUTHORS: Thanh Vu ; Dat Quoc Nguyen ; Anthony Nguyen
COMMENTS: In Proceedings of IJCAI 2020 (Main Track)
HIGHLIGHT: In this paper, we propose a new label attention model for automatic ICD coding, which can handle both the various lengths and the interdependence of the ICD code related text fragments.
53, TITLE: Disentanglement of Color and Shape Representations for Continual Learning
http://arxiv.org/abs/2007.06356
AUTHORS: David Berga ; Marc Masana ; Joost Van de Weijer
COMMENTS: Accepted at CL-ICML 2020
HIGHLIGHT: We combine our method with Elastic Weight Consolidation, Learning without Forgetting, Synaptic Intelligence and Memory Aware Synapses, and show that feature disentanglement positively impacts continual learning performance.
54, TITLE: Multiple Sound Sources Localization from Coarse to Fine
http://arxiv.org/abs/2007.06355
AUTHORS: Rui Qian ; Di Hu ; Heinrich Dinkel ; Mengyue Wu ; Ning Xu ; Weiyao Lin
COMMENTS: to appear in ECCV 2020
HIGHLIGHT: To solve this problem, we develop a two-stage audiovisual learning framework that disentangles audio and visual representations of different categories from complex scenes, then performs cross-modal feature alignment in a coarse-to-fine manner.
55, TITLE: ProtTrans: Towards Cracking the Language of Life's Code Through Self-Supervised Deep Learning and High Performance Computing
http://arxiv.org/abs/2007.06225
AUTHORS: Ahmed Elnaggar ; Michael Heinzinger ; Christian Dallago ; Ghalia Rihawi ; Yu Wang ; Llion Jones ; Tom Gibbs ; Tamas Feher ; Christoph Angerer ; Debsindhu Bhowmik ; Burkhard Rost
HIGHLIGHT: Methodology: Here, we trained two auto-regressive language models (Transformer-XL and XLNet) and two auto-encoder models (BERT and Albert) using 80 billion amino acids from 200 million protein sequences (UniRef100) and 393 billion amino acids from 2.1 billion protein sequences (BFD).
56, TITLE: Location-Aware Box Reasoning for Anchor-Based Single-Shot Object Detection
http://arxiv.org/abs/2007.06233
AUTHORS: Wenchi Ma ; Kaidong Li ; Guanghui Wang
HIGHLIGHT: In this paper, we aim at single-shot object detectors and propose a location-aware anchor-based reasoning (LAAR) for the bounding boxes.
57, TITLE: Active Crowd Counting with Limited Supervision
http://arxiv.org/abs/2007.06334
AUTHORS: Zhen Zhao ; Miaojing Shi ; Xiaoxiao Zhao ; Li Li
COMMENTS: ECCV2020 camera ready
HIGHLIGHT: Active Crowd Counting with Limited Supervision
58, TITLE: IntegralAction: Pose-driven Feature Integration for Robust Human Action Recognition in Videos
http://arxiv.org/abs/2007.06317
AUTHORS: Gyeongsik Moon ; Heeseung Kwon ; Kyoung Mu Lee ; Minsu Cho
HIGHLIGHT: To address this problem, we propose to learn pose-driven feature integration that dynamically combines appearance and pose streams by observing pose features on the fly.
59, TITLE: Hierarchical Dynamic Filtering Network for RGB-D Salient Object Detection
http://arxiv.org/abs/2007.06227
AUTHORS: Youwei Pang ; Lihe Zhang ; Xiaoqi Zhao ; Huchuan Lu
COMMENTS: The work has been accepted by ECCV 2020
HIGHLIGHT: In this paper, we explore these issues from a new perspective.
60, TITLE: Structured Policy Iteration for Linear Quadratic Regulator
http://arxiv.org/abs/2007.06202
AUTHORS: Youngsuk Park ; Ryan A. Rossi ; Zheng Wen ; Gang Wu ; Handong Zhao
HIGHLIGHT: In this paper, we introduce the \textit{Structured Policy Iteration} (S-PI) for LQR, a method capable of deriving a structured linear policy.
61, TITLE: Generating Functions for Probabilistic Programs
http://arxiv.org/abs/2007.06327
AUTHORS: Lutz Klinkenberg ; Kevin Batz ; Benjamin Lucien Kaminski ; Joost-Pieter Katoen ; Joshua Moerman ; Tobias Winkler
HIGHLIGHT: This paper investigates the usage of generating functions (GFs) encoding measures over the program variables for reasoning about discrete probabilistic programs.
62, TITLE: Domain aware medical image classifier interpretation by counterfactual impact analysis
http://arxiv.org/abs/2007.06312
AUTHORS: Dimitrios Lenis ; David Major ; Maria Wimmer ; Astrid Berg ; Gert Sluiter ; Katja Bühler
COMMENTS: Accepted for publication at International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2020
HIGHLIGHT: In this work, we discuss and overcome these obstacles by introducing a neural-network based attribution method, applicable to any trained predictor.
63, TITLE: Part-aware Prototype Network for Few-shot Semantic Segmentation
http://arxiv.org/abs/2007.06309
AUTHORS: Yongfei Liu ; Xiangyi Zhang ; Songyang Zhang ; Xuming He
COMMENTS: ECCV-2020
HIGHLIGHT: In this paper, we propose a novel few-shot semantic segmentation framework based on the prototype representation.
64, TITLE: DinerDash Gym: A Benchmark for Policy Learning in High-Dimensional Action Space
http://arxiv.org/abs/2007.06207
AUTHORS: Siwei Chen ; Xiao Ma ; David Hsu
HIGHLIGHT: On top of that, we introduce Decomposed Policy Graph Modelling (DPGM), an algorithm that combines both graph modelling and deep learning to allow explicit domain knowledge embedding and achieves significant improvement comparing to the baseline.
65, TITLE: Decoupling Inherent Risk and Early Cancer Signs in Image-based Breast Cancer Risk Models
http://arxiv.org/abs/2007.05791
AUTHORS: Yue Liu ; Hossein Azizpour ; Fredrik Strand ; Kevin Smith
COMMENTS: Medical Image Computing and Computer Assisted Interventions 2020
HIGHLIGHT: With this in mind, we trained networks using three different criteria to select the positive training data (i.e. images from patients that will develop cancer): an inherent risk model trained on images with no visible signs of cancer, a cancer signs model trained on images containing cancer or early signs of cancer, and a conflated model trained on all images from patients with a cancer diagnosis.
66, TITLE: Generalization of Deep Convolutional Neural Networks -- A Case-study on Open-source Chest Radiographs
http://arxiv.org/abs/2007.05786
AUTHORS: Nazanin Mashhaditafreshi ; Amara Tariq ; Judy Wawira Gichoya ; Imon Banerjee
HIGHLIGHT: We use InceptionResNetV2 and DenseNet121 architectures to predict the risk of 5 common chest pathologies.
67, TITLE: Planning on the fast lane: Learning to interact using attention mechanisms in path integral inverse reinforcement learning
http://arxiv.org/abs/2007.05798
AUTHORS: Sascha Rosbach ; Xing Li ; Simon Großjohann ; Silviu Homoceanu ; Stefan Roth
COMMENTS: Manuscript accepted at 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Received March 1, 2020. Typos corrected
HIGHLIGHT: In this work, we are concerned with the sequential reward prediction over an extended time horizon.
68, TITLE: Illuminating Mario Scenes in the Latent Space of a Generative Adversarial Network
http://arxiv.org/abs/2007.05674
AUTHORS: Matthew C. Fontaine ; Ruilin Liu ; Julian Togelius ; Amy K. Hoover ; Stefanos Nikolaidis
HIGHLIGHT: We introduce a new method called latent space illumination (LSI), which uses state-of-the-art quality diversity algorithms designed to optimize in continuous spaces, i.e., MAP-Elites with a directional variation operator and Covariance Matrix Adaptation MAP-Elites, to effectively search the parameter space of theGAN along a set of multiple level mechanics.
69, TITLE: Coarse-to-Fine Pseudo-Labeling Guided Meta-Learning for Few-Shot Classification
http://arxiv.org/abs/2007.05675
AUTHORS: Jinhai Yang ; Hua Yang ; Lin Chen
HIGHLIGHT: In this paper, we show that meta-learning models can extract transferable knowledge from coarse-grained supervision for few-shot classification.
70, TITLE: Uncertain-DeepSSM: From Images to Probabilistic Shape Models
http://arxiv.org/abs/2007.06516
AUTHORS: Jadie Adams ; Riddhish Bhalodia ; Shireen Elhabian
COMMENTS: 16 pages, 7 figures
HIGHLIGHT: Here, we propose Uncertain-DeepSSM as a unified model that quantifies both, data-dependent aleatoric uncertainty by adapting the network to predict intrinsic input variance, and model-dependent epistemic uncertainty via a Monte Carlo dropout sampling to approximate a variational distribution over the network parameters.
71, TITLE: To filter prune, or to layer prune, that is the question
http://arxiv.org/abs/2007.05667
AUTHORS: Sara Elkerdawy ; Mostafa Elhoushi ; Abhineet Singh ; Hong Zhang ; Nilanjan Ray
HIGHLIGHT: In this paper, we show the limitation of filter pruning methods in terms of latency reduction and propose LayerPrune framework.
72, TITLE: Fast Real-time Counterfactual Explanations
http://arxiv.org/abs/2007.05684
AUTHORS: Yunxia Zhao
COMMENTS: This paper has been accepted by ICML workshop 2020
HIGHLIGHT: Counterfactual explanations are considered, which is to answer {\it why the prediction is class A but not B.} Different from previous optimization based methods, an optimization-free Fast ReAl-time Counterfactual Explanation (FRACE) algorithm is proposed benefiting from the development of multi-domain image to image translation algorithms.
73, TITLE: Feature Selection on Noisy Twitter Short Text Messages for Language Identification
http://arxiv.org/abs/2007.05727
AUTHORS: Mohd Zeeshan Ansari ; Tanvir Ahmad ; Ana Fatima
HIGHLIGHT: In this article, we basically consider the Hindi-English language identification task as Hindi and English are often two most widely spoken languages of India.
74, TITLE: ECML: An Ensemble Cascade Metric Learning Mechanism towards Face Verification
http://arxiv.org/abs/2007.05720
AUTHORS: Fu Xiong ; Yang Xiao ; Zhiguo Cao ; Yancheng Wang ; Joey Tianyi Zhou ; Jianxi Wu
COMMENTS: Accepted to IEEE Transaction on Cybernetics
HIGHLIGHT: Hence, we propose a novel ensemble cascade metric learning (ECML) mechanism.
75, TITLE: Do We Need Sound for Sound Source Localization?
http://arxiv.org/abs/2007.05722
AUTHORS: Takashi Oya ; Shohei Iwase ; Ryota Natsume ; Takahiro Itazuri ; Shugo Yamaguchi ; Shigeo Morishima
COMMENTS: Paper: 14 pages, 6 figures. Supplementary Material: 6 pages, 3 figures. Videos and Codes will be released later
HIGHLIGHT: As an alternative, we present an evaluation protocol that enforces both visual and aural information to be leveraged, and verify this property through several experiments.
76, TITLE: Enhanced Behavioral Cloning Based self-driving Car Using Transfer Learning
http://arxiv.org/abs/2007.05740
AUTHORS: Uppala Sumanth ; Narinder Singh Punn ; Sanjay Kumar Sonbhadra ; Sonali Agarwal
HIGHLIGHT: Concerning the same, the present paper proposes a transfer learning approach using VGG16 architecture, which is fine tuned by retraining the last block while keeping other blocks as non-trainable.
77, TITLE: Learning Object Depth from Camera Motion and Video Object Segmentation
http://arxiv.org/abs/2007.05676
AUTHORS: Brent A. Griffin ; Jason J. Corso
HIGHLIGHT: We demonstrate our approach across domains using a robot camera to locate objects from the YCB dataset and a vehicle camera to locate obstacles while driving.
78, TITLE: Deep or Simple Models for Semantic Tagging? It Depends on your Data [Experiments]
http://arxiv.org/abs/2007.05651
AUTHORS: Jinfeng Li ; Yuliang Li ; Xiaolan Wang ; Wang-Chiew Tan
HIGHLIGHT: To answer this question, we compare deep models against "simple models" over datasets with varying characteristics.
79, TITLE: Learning Local Complex Features using Randomized Neural Networks for Texture Analysis
http://arxiv.org/abs/2007.05643
AUTHORS: Lucas C. Ribas ; Leonardo F. S. Scabini ; Jarbas Joaci de Mesquita Sá Junior ; Odemir M. Bruno
HIGHLIGHT: In this paper, we present a new approach that combines a learning technique and the Complex Network (CN) theory for texture analysis.
80, TITLE: Cascade Network with Guided Loss and Hybrid Attention for Two-view Geometry
http://arxiv.org/abs/2007.05706
AUTHORS: Zhi Chen ; Fan Yang Wenbing Tao
HIGHLIGHT: In this paper, we are committed to designing a high-performance network for two-view geometry.
81, TITLE: Deep Patch-based Human Segmentation
http://arxiv.org/abs/2007.05661
AUTHORS: Dongbo Zhang ; Zheng Fang ; Xuequan Lu ; Hong Qin ; Antonio Robles-Kelly ; Chao Zhang ; Ying He
COMMENTS: submitted for review
HIGHLIGHT: In this paper, weintroduce a deep patch-based method for 3D human segmentation.
82, TITLE: An Enhanced Text Classification to Explore Health based Indian Government Policy Tweets
http://arxiv.org/abs/2007.06511
AUTHORS: Aarzoo Dhiman ; Durga Toshniwal
COMMENTS: 4 pages, 2 figures, 2 tables
HIGHLIGHT: In this research work, we propose an improved text classification framework that classifies the Twitter data of different health-based government schemes.
83, TITLE: Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation
http://arxiv.org/abs/2007.05655
AUTHORS: Zhiwei Deng ; Karthik Narasimhan ; Olga Russakovsky
HIGHLIGHT: In this paper, we introduce the Evolving Graphical Planner (EGP), a model that performs global planning for navigation based on raw sensory input.
84, TITLE: AutoTrajectory: Label-free Trajectory Extraction and Prediction from Videos using Dynamic Points
http://arxiv.org/abs/2007.05719
AUTHORS: Yuexin Ma ; Xinge ZHU ; Xinjing Cheng ; Ruigang Yang ; Jiming Liu ; Dinesh Manocha
HIGHLIGHT: In this paper, we present a novel, label-free algorithm, AutoTrajectory, for trajectory extraction and prediction to use raw videos directly.
85, TITLE: Improved Detection of Adversarial Images Using Deep Neural Networks
http://arxiv.org/abs/2007.05573
AUTHORS: Yutong Gao ; Yi Pan
HIGHLIGHT: We propose a new approach called Feature Map Denoising to detect the adversarial inputs and show the performance of detection on the mixed dataset consisting of adversarial examples generated by different attack algorithms, which can be used to associate with any pre-trained DNNs at a low cost.
86, TITLE: Thirty-seven years of relational Hoare logic: remarks on its principles and history
http://arxiv.org/abs/2007.06421
AUTHORS: David A. Naumann
COMMENTS: Submitted to ISOLA 2020
HIGHLIGHT: Thirty-seven years of relational Hoare logic: remarks on its principles and history
87, TITLE: Conditional Lower Bound for Inclusion-Based Points-to Analysis
http://arxiv.org/abs/2007.05569
AUTHORS: Qirun Zhang
HIGHLIGHT: In this paper, we prove that a truly subcubic $O(n^{3-\delta})$ time combinatorial algorithm for inclusion-based points-to analysis is unlikely: a truly subcubic combinatorial points-to analysis algorithm implies a truly subcubic combinatorial algorithm for Boolean Matrix Multiplication (BMM).
88, TITLE: A Strong XOR Lemma for Randomized Query Complexity
http://arxiv.org/abs/2007.05580
AUTHORS: Joshua Brody ; Jae Tak Kim ; Peem Lerdputtipongporn ; Hari Srinivasulu
COMMENTS: 9 pages
HIGHLIGHT: We give a strong direct sum theorem for computing $xor \circ g$.
89, TITLE: Multitask Non-Autoregressive Model for Human Motion Prediction
http://arxiv.org/abs/2007.06426
AUTHORS: Bin Li ; Jian Tian ; Zhongfei Zhang ; Hailin Feng ; Xi Li
HIGHLIGHT: In this paper, we argue that such issue is mainly caused by adopting autoregressive manner.
90, TITLE: Vizarel: A System to Help Better Understand RL Agents
http://arxiv.org/abs/2007.05577
AUTHORS: Shuby Deshpande ; Jeff Schneider
COMMENTS: Accepted to ICML 2020 Workshop on Human Interpretability in Machine Learning (Spotlight)
HIGHLIGHT: In this work, we describe our initial attempt at constructing a prototype of these ideas, through identifying possible features that such a system should encapsulate.
91, TITLE: GGPONC: A Corpus of German Medical Text with Rich Metadata Based on Clinical Practice Guidelines
http://arxiv.org/abs/2007.06400
AUTHORS: Florian Borchert ; Christina Lohr ; Luise Modersohn ; Thomas Langer ; Markus Follmann ; Jan Philipp Sachs ; Udo Hahn ; Matthieu-P. Schapranow
HIGHLIGHT: In this work, we present GGPONC (German Guideline Program in Oncology NLP Corpus), a freely distributable German language corpus based on clinical practice guidelines in the field of oncology.
92, TITLE: Fashion-IQ 2020 Challenge 2nd Place Team's Solution
http://arxiv.org/abs/2007.06404
AUTHORS: Minchul Shin ; Yoonjae Cho ; Seongwuk Hong
COMMENTS: 4 pages, CVPR 2020 Workshop, Fashion IQ Challenge
HIGHLIGHT: Given a pair of the image and the text, we present a novel multimodal composition method, RTIC, that can effectively combine the text and the image modalities into a semantic space.
93, TITLE: Nested Learning For Multi-Granular Tasks
http://arxiv.org/abs/2007.06402
AUTHORS: Raphaël Achddou ; J. Matias di Martino ; Guillermo Sapiro
HIGHLIGHT: To address these challenges, we introduce the concept of nested learning: how to obtain a hierarchical representation of the input such that a coarse label can be extracted first, and sequentially refine this representation, if the sample permits, to obtain successively refined predictions, all of them with the corresponding confidence.
94, TITLE: Deep Cross-Subject Mapping of Neural Activity
http://arxiv.org/abs/2007.06407
AUTHORS: Marko Angjelichinoski ; Bijan Pesaran ; Vahid Tarokh
COMMENTS: Submitted to NeurIPS 2020: (15 pages, 7 figures, 5 tables)
HIGHLIGHT: In this paper, we demonstrate that a neural decoder trained on neural activity signals of one subject can be used to \textit{robustly} decode the motor intentions of a different subject with high reliability.
95, TITLE: PA-GAN: Progressive Attention Generative Adversarial Network for Facial Attribute Editing
http://arxiv.org/abs/2007.05892
AUTHORS: Zhenliang He ; Meina Kan ; Jichao Zhang ; Shiguang Shan
COMMENTS: Code: https://github.com/LynnHo/PA-GAN-Tensorflow
HIGHLIGHT: To resolve this dilemma, we propose a progressive attention GAN (PA-GAN) for facial attribute editing.
96, TITLE: HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections
http://arxiv.org/abs/2007.05891
AUTHORS: Yi Tay ; Zhe Zhao ; Dara Bahri ; Donald Metzler ; Da-Cheng Juan
HIGHLIGHT: In this paper, we propose \textsc{HyperGrid}, a new approach for highly effective multi-task learning.
97, TITLE: Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
http://arxiv.org/abs/2007.05896
AUTHORS: Evan Zheran Liu ; Ramtin Keramati ; Sudarshan Seshadri ; Kelvin Guu ; Panupong Pasupat ; Emma Brunskill ; Percy Liang
HIGHLIGHT: Instead, to avoid compounding errors, we propose learning an abstract MDP over abstract states: low-dimensional coarse representations of the state (e.g., capturing agent position, ignoring other objects).
98, TITLE: Train Your Data Processor: Distribution-Aware and Error-Compensation Coordinate Decoding for Human Pose Estimation
http://arxiv.org/abs/2007.05887
AUTHORS: Feiyu Yang ; Yu Chen ; Zhe Pan ; Min Zhang ; Min Xue ; Yaoyang Mo ; Yao Zhang ; Guoxiong Guan ; Beibei Qian ; Zhenzhong Xiao ; Zhan Song
COMMENTS: Improve the state-of-the-art of COCO keypoint detection challenge by 1-2 AP. Project page: https://github.com/fyang235/DAEC
HIGHLIGHT: Serving as a model-agnostic plug-in, DAEC learns its decoding strategy from training data and remarkably improves the performance of a variety of state-of-the-art human pose estimation models.
99, TITLE: I3rab: A New Arabic Dependency Treebank Based on Arabic Grammatical Theory
http://arxiv.org/abs/2007.05772
AUTHORS: Dana Halabi ; Ebaa Fayyoumi ; Arafat Awajan
HIGHLIGHT: The purpose of this paper is to construct a new Arabic dependency treebank based on the traditional Arabic grammatical theory and the characteristics of the Arabic language, to investigate their effects on the accuracy of statistical parsers.
100, TITLE: Is Machine Learning Speaking my Language? A Critical Look at the NLP-Pipeline Across 8 Human Languages
http://arxiv.org/abs/2007.05872
AUTHORS: Esma Wali ; Yan Chen ; Christopher Mahoney ; Thomas Middleton ; Marzieh Babaeianjelodar ; Mariama Njie ; Jeanna Neefe Matthews
COMMENTS: Participatory Approaches to Machine Learning Workshop, 37th International Conference on Machine Learning
HIGHLIGHT: In this paper, a team including speakers of 8 languages - English, Chinese, Urdu, Farsi, Arabic, French, Spanish, and Wolof - takes a critical look at the typical NLP pipeline and how even when a language is technically supported, substantial caveats remain to prevent full participation.
101, TITLE: Applying recent advances in Visual Question Answering to Record Linkage
http://arxiv.org/abs/2007.05881
AUTHORS: Marko Smilevski
COMMENTS: 48 pages, 15 figures, 6 tables
HIGHLIGHT: This field has not been explored in research and we propose two solutions based on Deep Learning architectures that are inspired by recent work in Visual Question Answering.
102, TITLE: Leaky Integrate-and-Fire Spiking Neuron with Learnable Membrane Time Parameter
http://arxiv.org/abs/2007.05785
AUTHORS: Wei Fang
HIGHLIGHT: In this article, we propose a novel spiking neuron, namely parametric Leaky Integrate-and-Fire (PLIF) neuron, whose $\tau$ is a learnable parameter rather than an empirical hyper-parameter.
103, TITLE: A Hybrid Multi-Objective Carpool Route Optimization Technique using Genetic Algorithm and A* Algorithm
http://arxiv.org/abs/2007.05781
AUTHORS: Romit S Beed ; Sunita Sarkar ; Arindam Roy ; Suvranil D Biswas ; Suhana Biswas
HIGHLIGHT: This work presents a hybrid GA-A* algorithm to obtain optimal routes for the carpooling problem in the domain of multi-objective optimization having multiple conflicting objectives.
104, TITLE: Pyramid Scale Network for Crowd Counting
http://arxiv.org/abs/2007.05779
AUTHORS: Junhao Cheng ; Zhuojun Chen ; XinYu Zhang ; Yizhou Li ; Xiaoyuan Jing
HIGHLIGHT: In this paper, we propose a novel crowd counting framework called Pyramid Scale Network (PSNet) to explicitly address these issues.
105, TITLE: Driver Behavior Modelling at the Urban Intersection via Canonical Correlation Analysis
http://arxiv.org/abs/2007.05751
AUTHORS: Zirui Li ; Chao Lu ; Cheng Gong ; Cheng Gong ; Jinghang Li ; Lianzhen Wei
COMMENTS: 2020 3rd IEEE International Conference on Unmanned Systems (ICUS)
HIGHLIGHT: In this research, a canonical correlation analysis (CCA)-based framework is proposed.
106, TITLE: Neuromorphic Processing and Sensing: Evolutionary Progression of AI to Spiking
http://arxiv.org/abs/2007.05606
AUTHORS: Philippe Reiter ; Geet Rose Jose ; Spyridon Bizmpikis ; Ionela-Ancuţa Cîrjilă
COMMENTS: 15 pages, 13 figures
HIGHLIGHT: This paper explains the theoretical workings of neuromorphic technologies based on spikes, and overviews the state-of-art in hardware processors, software platforms and neuromorphic sensing devices.
107, TITLE: Class LM and word mapping for contextual biasing in End-to-End ASR
http://arxiv.org/abs/2007.05609
AUTHORS: Rongqing Huang ; Ossama Abdel-hamid ; Xinwei Li ; Gunnar Evermann
HIGHLIGHT: In this paper, we propose to train a context aware E2E model and allow the beam search to traverse into the context FST during inference.
108, TITLE: Image Captioning with Compositional Neural Module Networks
http://arxiv.org/abs/2007.05608
AUTHORS: Junjiao Tian ; Jean Oh
COMMENTS: International Joint Conference on Artificial Intelligence (IJCAI-19)
HIGHLIGHT: Inspired by the idea of the compositional neural module networks in the visual question answering task, we introduce a hierarchical framework for image captioning that explores both compositionality and sequentiality of natural language.
109, TITLE: Relation-Guided Representation Learning
http://arxiv.org/abs/2007.05742
AUTHORS: Zhao Kang ; Xiao Lu ; Jian Liang ; Kun Bai ; Zenglin Xu
COMMENTS: Appear in Neural Networks
HIGHLIGHT: In this work, we propose a new representation learning method that explicitly models and leverages sample relations, which in turn is used as supervision to guide the representation learning.
110, TITLE: Distangling Biological Noise in Cellular Images with a focus on Explainability
http://arxiv.org/abs/2007.05743
AUTHORS: Manik Sharma ; Ganapathy Krishnamurthi
COMMENTS: 13 Pages, 12 figures
HIGHLIGHT: This work aims at solving a part of this problem by creating a cellular image classification model which can decipher the genetic perturbations in cell (occurring naturally or artificially).
111, TITLE: Weighted First-Order Model Counting in the Two-Variable Fragment With Counting Quantifiers
http://arxiv.org/abs/2007.05619
AUTHORS: Ondrej Kuzelka
HIGHLIGHT: In this paper we extend this result to the two-variable fragment with counting quantifiers.
112, TITLE: GloVeInit at SemEval-2020 Task 1: Using GloVe Vector Initialization for Unsupervised Lexical Semantic Change Detection
http://arxiv.org/abs/2007.05618
AUTHORS: Vaibhav Jain
COMMENTS: To be presented at the 2020 International Workshop on Semantic Evaluation
HIGHLIGHT: This paper presents a vector initialization approach for the SemEval2020 Task 1: Unsupervised Lexical Semantic Change Detection.
113, TITLE: Multi-Dialect Arabic BERT for Country-Level Dialect Identification
http://arxiv.org/abs/2007.05612
AUTHORS: Bashar Talafha ; Mohammad Ali ; Muhy Eddin Za'ter ; Haitham Seelawi ; Ibraheem Tuffaha ; Mostafa Samir ; Wael Farhan ; Hussein T. Al-Natsheh
COMMENTS: Accepted at the Fifth Arabic Natural Language Processing Workshop (WANLP2020) co-located with the 28th International Conference on Computational Linguistics (COLING'2020), Barcelona, Spain, 12 Dec. 2020
HIGHLIGHT: In this paper, we present the experiments conducted, and the models developed by our competing team, Mawdoo3 AI, along the way to achieving our winning solution to subtask 1 of the Nuanced Arabic Dialect Identification (NADI) shared task.
114, TITLE: Deep Contextual Clinical Prediction with Reverse Distillation
http://arxiv.org/abs/2007.05611
AUTHORS: Rohan S. Kodialam ; Rebecca Boiarsky ; David Sontag
HIGHLIGHT: In this work, motivated by the task of clinical prediction from insurance claims, we present a new technique called reverse distillation which pretrains deep models by using high-performing linear models for initialization.
115, TITLE: Batch-Incremental Triplet Sampling for Training Triplet Networks Using Bayesian Updating Theorem
http://arxiv.org/abs/2007.05610
AUTHORS: Milad Sikaroudi ; Benyamin Ghojogh ; Fakhri Karray ; Mark Crowley ; H. R. Tizhoosh
COMMENTS: The first two authors contributed equally to this work
HIGHLIGHT: In this work, we sample triplets from distributions of data rather than from existing instances.
116, TITLE: Quantization in Relative Gradient Angle Domain For Building Polygon Estimation
http://arxiv.org/abs/2007.05617
AUTHORS: Yuhao Chen ; Yifan Wu ; Linlin Xu ; Alexander Wong
HIGHLIGHT: In this paper, we leverage the performance of CNNs, and propose a module that uses prior knowledge of building corners to create angular and concise building polygons from CNN segmentation outputs.
117, TITLE: PCAMs: Weakly Supervised Semantic Segmentation Using Point Supervision
http://arxiv.org/abs/2007.05615
AUTHORS: R. Austin McEver ; B. S. Manjunath
HIGHLIGHT: This paper presents a novel procedure for producing semantic segmentation from images given some point level annotations.
118, TITLE: Generative Graph Perturbations for Scene Graph Prediction
http://arxiv.org/abs/2007.05756
AUTHORS: Boris Knyazev ; Harm de Vries ; Cătălina Cangea ; Graham W. Taylor ; Aaron Courville ; Eugene Belilovsky
COMMENTS: https://oolworkshop.github.io/program/ool_21.html, ICML Workshop 2020 on "Object-Oriented Learning (OOL): Perception, Representation, and Reasoning"
HIGHLIGHT: To increase their diversity, we propose several strategies to perturb the conditioning.
119, TITLE: Probability Learning based Tabu Search for the Budgeted Maximum Coverage Problem
http://arxiv.org/abs/2007.05971
AUTHORS: Liwen Li ; Zequn Wei ; Jin-Kao Hao ; Kun He
HIGHLIGHT: In this work, we deal with the Budgeted Maximum Coverage Problem (BMCP), which is a generalized 0-1 knapsack problem.
120, TITLE: Towards practical lipreading with distilled and efficient models
http://arxiv.org/abs/2007.06504
AUTHORS: Pingchuan Ma ; Brais Martinez ; Stavros Petridis ; Maja Pantic
HIGHLIGHT: In this work, we propose a series of innovations that significantly bridge that gap: first, we raise the state-of-the-art performance by a wide margin on LRW and LRW-1000 to 88.6% and 46.6%, respectively, through careful optimization.
121, TITLE: IllumiNet: Transferring Illumination from Planar Surfaces to Virtual Objects in Augmented Reality
http://arxiv.org/abs/2007.05981
AUTHORS: Di Xu ; Zhen Li ; Yanning Zhang ; Qi Cao
HIGHLIGHT: This paper presents an illumination estimation method for virtual objects in real environment by learning.
122, TITLE: Stance Detection in Web and Social Media: A Comparative Study
http://arxiv.org/abs/2007.05976
AUTHORS: Shalmoli Ghosh ; Prajwal Singhania ; Siddharth Singh ; Koustav Rudra ; Saptarshi Ghosh
HIGHLIGHT: In this work, we explore the reproducibility of several existing stance detection models, including both neural models and classical classifier-based models.
123, TITLE: Self-Supervised Drivable Area and Road Anomaly Segmentation using RGB-D Data for Robotic Wheelchairs
http://arxiv.org/abs/2007.05950
AUTHORS: Hengli Wang ; Yuxiang Sun ; Ming Liu
COMMENTS: Published in IEEE Robotics and Automation Letters (RA-L); 8 pages, 8 figures and 3 tables
HIGHLIGHT: We contribute to the solution of this problem for the task of drivable area and road anomaly segmentation by proposing a self-supervised learning approach.
124, TITLE: Anomaly Detection-Based Unknown Face Presentation Attack Detection
http://arxiv.org/abs/2007.05856
AUTHORS: Yashasvi Baweja ; Poojan Oza ; Pramuditha Perera ; Vishal M. Patel
HIGHLIGHT: In this paper, we present a deep-learning solution for anomaly detection-based spoof attack detection where both classifier and feature representations are learned together end-to-end.
125, TITLE: Complex Wavelet SSIM based Image Data Augmentation
http://arxiv.org/abs/2007.05853
AUTHORS: Ritin Raveendran ; Aviral Singh ; Rajesh Kumar M
HIGHLIGHT: In this paper, we look particularly at the MNIST handwritten dataset an image dataset used for digit recognition, and the methods of data augmentation done on this data set.
126, TITLE: Efficient resource management in UAVs for Visual Assistance
http://arxiv.org/abs/2007.05854
AUTHORS: Bapireddy Karri ; Sudip Misra ; Senior Member IEEE
COMMENTS: 7 pages
HIGHLIGHT: This projects describes a novel method to optimize the general image processing tasks like object tracking and object detection for UAV hardware in real time scenarios without affecting the flight time and not tampering the latency and accuracy of these models.
127, TITLE: Understanding Object Detection Through An Adversarial Lens
http://arxiv.org/abs/2007.05828
AUTHORS: Ka-Ho Chow ; Ling Liu ; Mehmet Emre Gursoy ; Stacey Truex ; Wenqi Wei ; Yanzhao Wu
HIGHLIGHT: This paper presents a framework for analyzing and evaluating vulnerabilities of the state-of-the-art object detectors under an adversarial lens, aiming to analyze and demystify the attack strategies, adverse effects, and costs, as well as the cross-model and cross-resolution transferability of attacks.
128, TITLE: Representation Learning via Adversarially-Contrastive Optimal Transport
http://arxiv.org/abs/2007.05840
AUTHORS: Anoop Cherian ; Shuchin Aeron
COMMENTS: Accepted at ICML 2020
HIGHLIGHT: In this paper, we study the problem of learning compact (low-dimensional) representations for sequential data that captures its implicit spatio-temporal cues.
129, TITLE: Control as Hybrid Inference
http://arxiv.org/abs/2007.05838
AUTHORS: Alexander Tschantz ; Beren Millidge ; Anil K. Seth ; Christopher L. Buckley
HIGHLIGHT: We present an implementation of CHI which naturally mediates the balance between iterative and amortised inference.
130, TITLE: Deep Network Interpolation for Accelerated Parallel MR Image Reconstruction
http://arxiv.org/abs/2007.05993
AUTHORS: Chen Qin ; Jo Schlemper ; Kerstin Hammernik ; Jinming Duan ; Ronald M Summers ; Daniel Rueckert
COMMENTS: Presented at 2020 ISMRM Conference & Exhibition (Abstract #4958)
HIGHLIGHT: We present a deep network interpolation strategy for accelerated parallel MR image reconstruction.
131, TITLE: Differentiable Programming for Hyperspectral Unmixing using a Physics-based Dispersion Model
http://arxiv.org/abs/2007.05996
AUTHORS: John Janiczek ; Parth Thaker ; Gautam Dasarathy ; Christopher S. Edwards ; Philip Christensen ; Suren Jayasuriya
COMMENTS: 36 pages, 11 figures. Accepted to European Conference on Computer Vision (ECCV) 2020
HIGHLIGHT: In this paper, spectral variation is considered from a physics-based approach and incorporated into an end-to-end spectral unmixing algorithm via differentiable programming.
132, TITLE: Dual Adversarial Network: Toward Real-world Noise Removal and Noise Generation
http://arxiv.org/abs/2007.05946
AUTHORS: Zongsheng Yue ; Qian Zhao ; Lei Zhang ; Deyu Meng
COMMENTS: Accepted by ECCV 2020
HIGHLIGHT: In this work, we propose a novel unified framework to simultaneously deal with the noise removal and noise generation tasks.
133, TITLE: Fruit classification using deep feature maps in the presence of deceptive similar classes
http://arxiv.org/abs/2007.05942
AUTHORS: Mohit Dandekar ; Narinder Singh Punn ; Sanjay Kumar Sonbhadra ; Sonali Agarwal
HIGHLIGHT: The objective of the present research is to address the challenge of classification of deceptively similar multi-granular objects with an ensemble approach thfat utilizes activations from multiple layers of CNN (deep features).
134, TITLE: Relational-Grid-World: A Novel Relational Reasoning Environment and An Agent Model for Relational Information Extraction
http://arxiv.org/abs/2007.05961
AUTHORS: Faruk Kucuksubasi ; Elif Surer
HIGHLIGHT: In this study, we present a model-free RL architecture that is supported with explicit relational representations of the environmental objects.
135, TITLE: Abstract Universal Approximation for Neural Networks
http://arxiv.org/abs/2007.06093
AUTHORS: Zi Wang ; Aws Albarghouthi ; Somesh Jha
HIGHLIGHT: We present a theoretical result that demonstrates the power of numerical domains, namely, the simple interval domain, for analysis of neural networks.
136, TITLE: Visualizing Classification Structure in Deep Neural Networks
http://arxiv.org/abs/2007.06068
AUTHORS: Bilal Alsallakh ; Zhixin Yan ; Shabnam Ghaffarzadegan ; Zeng Dai ; Liu Ren
COMMENTS: 2020 ICML Workshop on Human Interpretability in Machine Learning (WHI 2020)
HIGHLIGHT: We propose a measure to compute class similarity in large-scale classification based on prediction scores.
137, TITLE: Adversarial Self-Supervised Learning for Semi-Supervised 3D Action Recognition
http://arxiv.org/abs/2007.05934
AUTHORS: Chenyang Si ; Xuecheng Nie ; Wei Wang ; Liang Wang ; Tieniu Tan ; Jiashi Feng
COMMENTS: Accepted by ECCV2020
HIGHLIGHT: To address these issues, we present Adversarial Self-Supervised Learning (ASSL), a novel framework that tightly couples SSL and the semi-supervised scheme via neighbor relation exploration and adversarial learning.
138, TITLE: Exploiting Uncertainties from Ensemble Learners to Improve Decision-Making in Healthcare AI
http://arxiv.org/abs/2007.06063
AUTHORS: Yingshui Tan ; Baihong Jin ; Xiangyu Yue ; Yuxin Chen ; Alberto Sangiovanni Vincentelli
COMMENTS: Preprint of submission to NeurIPS 2020
HIGHLIGHT: In this paper, we study the following key research question in the selection of uncertainty metrics: when does an uncertainty metric outperforms another?
139, TITLE: Pose-aware Adversarial Domain Adaptation for Personalized Facial Expression Recognition
http://arxiv.org/abs/2007.05932
AUTHORS: Guang Liang ; Shangfei Wang ; Can Wang
HIGHLIGHT: In this paper, we propose a novel unsupervised adversarial domain adaptation method which can alleviate both variations at the same time.
140, TITLE: Fine-grained Language Identification with Multilingual CapsNet Model
http://arxiv.org/abs/2007.06078
AUTHORS: Mudit Verma ; Arun Balaji Buduru
COMMENTS: 5 pages, 6 figures
HIGHLIGHT: Hence in this work, a real-time language detection approach to detect spoken language from 5 seconds' audio clips with an accuracy of 91.8\% is presented with exiguous data requirements and minimal pre-processing.
141, TITLE: Framework for Passenger Seat Availability Using Face Detection in Passenger Bus
http://arxiv.org/abs/2007.05906
AUTHORS: Khawar Islam ; Uzma Afzal
HIGHLIGHT: We propose a Face Detection based Framework (FDF) to determine passenger seat availability in a camera-equipped bus through face detection which is based on background subtraction to count empty, filled, and total seats.
142, TITLE: Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation
http://arxiv.org/abs/2007.06077
AUTHORS: Aditya Mogadala ; Marius Mosbach ; Dietrich Klakow
COMMENTS: International Conference on Machine Learning (ICML) 2020 Workshop (https://logicalreasoninggnn.github.io/)
HIGHLIGHT: In this paper, we mask this Vision-to-Sequence as Graph-to-Sequence learning problem and approach it with the Transformer architecture.
143, TITLE: Editable AI: Mixed Human-AI Authoring of Code Patterns
http://arxiv.org/abs/2007.05902
AUTHORS: Kartik Chugh ; Andrea Y. Solis ; Thomas D. LaToza
HIGHLIGHT: To surface these patterns to developers and support developers in authoring consistent with these patterns, we propose a mixed human-AI technique for creating code patterns.
144, TITLE: Lightweight Modules for Efficient Deep Learning based Image Restoration
http://arxiv.org/abs/2007.05835
AUTHORS: Avisek Lahiri ; Sourav Bairagya ; Sutanu Bera ; Siddhant Haldar ; Prabir Kumar Biswas
COMMENTS: Accepted at: IEEE Transactions on Circuits and Systems for Video Technology (Early Access Print) | |Codes Available at: https://github.com/avisekiit/TCSVT-LightWeight-CNNs | Supplementary Document at: https://drive.google.com/file/d/1BQhkh33Sen-d0qOrjq5h8ahw2VCUIVLg/view?usp=sharing
HIGHLIGHT: In this paper, we propose several lightweight low-level modules which can be used to create a computationally low cost variant of a given baseline model.
145, TITLE: Meta Soft Label Generation for Noisy Labels
http://arxiv.org/abs/2007.05836
AUTHORS: Görkem Algan ; Ilkay Ulusoy
HIGHLIGHT: To address this problem, we propose a Meta Soft Label Generation algorithm called MSLG, which can jointly generate soft labels using meta-learning techniques and learn DNN parameters in an end-to-end fashion.
146, TITLE: AutoEmbedder: A semi-supervised DNN embedding system for clustering
http://arxiv.org/abs/2007.05830
AUTHORS: Abu Quwsar Ohi ; M. F. Mridha ; Farisa Benta Safir ; Md. Abdul Hamid ; Muhammad Mostafa Monowar
COMMENTS: The manuscript is accepted and published in Knowledge-Based System
HIGHLIGHT: This paper introduces a novel embedding system named AutoEmbedder, that downsamples higher dimensional data to clusterable embedding points.
147, TITLE: The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results
http://arxiv.org/abs/2007.05916
AUTHORS: Xian Shi ; Qiangze Feng ; Lei Xie
HIGHLIGHT: This paper describes the design and main outcomes of the ASRU 2019 Mandarin-English code-switching speech recognition challenge, which aims to improve the ASR performance in Mandarin-English code-switching situation.
148, TITLE: Two-Stream Deep Feature Modelling for Automated Video Endoscopy Data Analysis
http://arxiv.org/abs/2007.05914
AUTHORS: Harshala Gammulle ; Simon Denman ; Sridha Sridharan ; Clinton Fookes
COMMENTS: Accepted for Publication at MICCAI 2020
HIGHLIGHT: To further the development of such methods, we propose a two-stream model for endoscopic image analysis.
149, TITLE: Understanding Adversarial Examples from the Mutual Influence of Images and Perturbations
http://arxiv.org/abs/2007.06189
AUTHORS: Chaoning Zhang ; Philipp Benz ; Tooba Imtiaz ; In-So Kweon
COMMENTS: Accepted at CVPR 2020
HIGHLIGHT: We propose to treat the DNN logits as a vector for feature representation, and exploit them to analyze the mutual influence of two independent inputs based on the Pearson correlation coefficient (PCC).
150, TITLE: Learning to Learn Parameterized Classification Networks for Scalable Input Images
http://arxiv.org/abs/2007.06181
AUTHORS: Duo Li ; Anbang Yao ; Qifeng Chen
COMMENTS: Accepted by ECCV 2020. Code and models are available at https://github.com/d-li14/SAN
HIGHLIGHT: To achieve efficient and flexible image classification at runtime, we employ meta learners to generate convolutional weights of main networks for various input scales and maintain privatized Batch Normalization layers per scale.
151, TITLE: Fusing Motion Patterns and Key Visual Information for Semantic Event Recognition in Basketball Videos
http://arxiv.org/abs/2007.06288
AUTHORS: Lifang Wu ; Zhou Yang ; Qi Wang ; Meng Jian ; Boxuan Zhao ; Junchi Yan ; Chang Wen Chen
HIGHLIGHT: Based on the observations, we propose a scheme to fuse global and local motion patterns (MPs) and key visual information (KVI) for semantic event recognition in basketball videos.
152, TITLE: Accelerated FBP for computed tomography image reconstruction
http://arxiv.org/abs/2007.06289
AUTHORS: Anastasiya Dolmatova ; Marina Chukalina ; Dmitry Nikolaev
HIGHLIGHT: In this paper, we propose a novel approach that reduces the computational complexity of the algorithm to $\Theta(N^2\log N)$ addition operations avoiding Fourier space.
153, TITLE: Data from Model: Extracting Data from Non-robust and Robust Models
http://arxiv.org/abs/2007.06196
AUTHORS: Philipp Benz ; Chaoning Zhang ; Tooba Imtiaz ; In-So Kweon
COMMENTS: Accepted at the CVPR 2020 Workshop on Adversarial Machine Learning in Computer Vision
HIGHLIGHT: We repeat the process of Data to Model (DtM) and Data from Model (DfM) in sequence and explore the loss of feature mapping information by measuring the accuracy drop on the original validation dataset.
154, TITLE: Beyond Graph Neural Networks with Lifted Relational Neural Networks
http://arxiv.org/abs/2007.06286
AUTHORS: Gustav Sourek ; Filip Zelezny ; Ondrej Kuzelka
COMMENTS: Submitted to MLJ's Special Track on Learning and Reasoning (May 15th 2020 cut-off) http://lr2020.iit.demokritos.gr/
HIGHLIGHT: We demonstrate a declarative differentiable programming framework based on the language of Lifted Relational Neural Networks, where small parameterized logic programs are used to encode relational learning scenarios.
155, TITLE: Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder
http://arxiv.org/abs/2007.06198
AUTHORS: Gouthaman KV ; Anurag Mittal
COMMENTS: ECCV 2020
HIGHLIGHT: In this work, we propose a novel model-agnostic question encoder, Visually-Grounded Question Encoder (VGQE), for VQA that reduces this effect.
156, TITLE: CheXphoto: 10,000+ Smartphone Photos and Synthetic Photographic Transformations of Chest X-rays for Benchmarking Deep Learning Robustness
http://arxiv.org/abs/2007.06199
AUTHORS: Nick A. Phillips ; Pranav Rajpurkar ; Mark Sabini ; Rayan Krishnan ; Sharon Zhou ; Anuj Pareek ; Nguyet Minh Phu ; Chris Wang ; Andrew Y. Ng ; Matthew P. Lungren
HIGHLIGHT: We introduce CheXphoto, a dataset of smartphone photos and synthetic photographic transformations of chest x-rays sampled from the CheXpert dataset. We release this dataset as a resource for testing and improving the robustness of deep learning algorithms for automated chest x-ray interpretation on smartphone photos of chest x-rays.
157, TITLE: Probabilistic bounds on data sensitivity in deep rectifier networks
http://arxiv.org/abs/2007.06192
AUTHORS: Blaine Rister ; Daniel L. Rubin
HIGHLIGHT: In this work, we provide a simple and rigorous proof of that result.
158, TITLE: PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer
http://arxiv.org/abs/2007.06191
AUTHORS: Duo Li ; Anbang Yao ; Qifeng Chen
COMMENTS: Accepted by ECCV 2020. Code and models are available at https://github.com/d-li14/PSConv
HIGHLIGHT: We bridge this regret by exploiting multi-scale features in a finer granularity.
159, TITLE: Low to High Dimensional Modality Hallucination using Aggregated Fields of View
http://arxiv.org/abs/2007.06166
AUTHORS: Kausic Gunasekar ; Qiang Qiu ; Yezhou Yang
HIGHLIGHT: We present a novel hallucination architecture that aggregates information from multiple fields of view of the local neighborhood to recover the lost information from the extant modality.
160, TITLE: Do You Have the Right Scissors? Tailoring Pre-trained Language Models via Monte-Carlo Methods
http://arxiv.org/abs/2007.06162
AUTHORS: Ning Miao ; Yuxuan Song ; Hao Zhou ; Lei Li
COMMENTS: Accepted by ACL 2020
HIGHLIGHT: In this paper, we propose MC-Tailor, a novel method to alleviate the above issue in text generation tasks by truncating and transferring the probability mass from over-estimated regions to under-estimated ones.
161, TITLE: Bridging Maximum Likelihood and Adversarial Learning via $α$-Divergence
http://arxiv.org/abs/2007.06178
AUTHORS: Miaoyun Zhao ; Yulai Cong ; Shuyang Dai ; Lawrence Carin
COMMENTS: AAAI 2020
HIGHLIGHT: We reveal that generalizations of the $\alpha$-Bridge are closely related to approaches developed recently to regularize adversarial learning, providing insights into that prior work, and further understanding of why the $\alpha$-Bridge performs well in practice.
162, TITLE: Generating Fluent Adversarial Examples for Natural Languages
http://arxiv.org/abs/2007.06174
AUTHORS: Huangzhao Zhang ; Hao Zhou ; Ning Miao ; Lei Li
COMMENTS: Accepted by ACL 2019
HIGHLIGHT: In this paper, we propose MHA, which addresses both problems by performing Metropolis-Hastings sampling, whose proposal is designed with the guidance of gradients.
163, TITLE: Coarse scale representation of spiking neural networks: backpropagation through spikes and application to neuromorphic hardware
http://arxiv.org/abs/2007.06176
AUTHORS: Angel Yanguas-Gil
COMMENTS: Paper accepted in ICONS 2020
HIGHLIGHT: In this work we explore recurrent representations of leaky integrate and fire neurons operating at a timescale equal to their absolute refractory period.
164, TITLE: Fine-Grained Crowd Counting
http://arxiv.org/abs/2007.06146
AUTHORS: Jia Wan ; Nikil Senthil Kumar ; Antoni B. Chan
HIGHLIGHT: In this paper, we propose fine-grained crowd counting, which differentiates a crowd into categories based on the low-level behavior attributes of the individuals (e.g. standing/sitting or violent behavior) and then counts the number of people in each category. To enable research in this area, we construct a new dataset of four real-world fine-grained counting tasks: traveling direction on a sidewalk, standing or sitting, waiting in line or not, and exhibiting violent behavior or not.
165, TITLE: Gender Classification and Bias Mitigation in Facial Images
http://arxiv.org/abs/2007.06141
AUTHORS: Wenying Wu ; Pavlos Protopapas ; Zheng Yang ; Panagiotis Michalatos
COMMENTS: 9 pages
HIGHLIGHT: In this paper, we began by conducting surveys on existing benchmark databases for facial recognition and gender classification tasks.
166, TITLE: Embedded Deep Bilinear Interactive Information and Selective Fusion for Multi-view Learning
http://arxiv.org/abs/2007.06143
AUTHORS: Jinglin Xu ; Wenbin Li ; Jiantao Shen ; Xinwang Liu ; Peicheng Zhou ; Xiangsen Zhang ; Xiwen Yao ; Junwei Han
HIGHLIGHT: To fulfill this goal, we propose a novel multi-view learning framework to make the multi-view classification better aimed at the above-mentioned two aspects.
167, TITLE: Temporal Self-Ensembling Teacher for Semi-Supervised Object Detection
http://arxiv.org/abs/2007.06144
AUTHORS: Cong Chen ; Shouyang Dong ; Ye Tian abd Kunlin Cao ; Li Liu ; Yuanhao Guo
COMMENTS: 11 papges, 3 figures, preprint for submission
HIGHLIGHT: To solve these problems, we propose the Temporal Self-Ensembling Teacher (TSE-T) model on top of the KD framework.
168, TITLE: Deep Reinforced Attention Learning for Quality-Aware Visual Recognition
http://arxiv.org/abs/2007.06156
AUTHORS: Duo Li ; Qifeng Chen
COMMENTS: Accepted by ECCV 2020
HIGHLIGHT: In this paper, we build upon the weakly-supervised generation mechanism of intermediate attention maps in any convolutional neural networks and disclose the effectiveness of attention modules more straightforwardly to fully exploit their potential.
169, TITLE: AI Playground: Unreal Engine-based Data Ablation Tool for Deep Learning
http://arxiv.org/abs/2007.06153
AUTHORS: Mehdi Mousavi ; Aashis Khanal ; Rolando Estrada
COMMENTS: 14 pages, 7 figures
HIGHLIGHT: In this paper, we present AI Playground (AIP), an open-source, Unreal Engine-based tool for generating and labeling virtual image data. To validate our proposed tool, we generated eight datasets of otherwise identical but varying lighting and fidelity conditions.
170, TITLE: MS-NAS: Multi-Scale Neural Architecture Search for Medical Image Segmentation
http://arxiv.org/abs/2007.06151
AUTHORS: Xingang Yan ; Weiwen Jiang ; Yiyu Shi ; Cheng Zhuo
HIGHLIGHT: This paper presents a Multi-Scale NAS (MS-NAS) framework that is featured with multi-scale search space from network backbone to cell operation, and multi-scale fusion capability to fuse features with different sizes.
171, TITLE: Universal-to-Specific Framework for Complex Action Recognition
http://arxiv.org/abs/2007.06149
AUTHORS: Peisen Zhao ; Lingxi Xie ; Ya Zhang ; Qi Tian
COMMENTS: 13 pages, 8 figures
HIGHLIGHT: Inspired by a common flowchart based on the human decision-making process that first narrows down the probable classes and then applies a "rethinking" process for finer-level recognition, we propose an effective universal-to-specific (U2S) framework for complex action recognition.
172, TITLE: EAGLE: Large-scale Dataset for Vehicle Detection in Aerial Imagery
http://arxiv.org/abs/2007.06124
AUTHORS: Seyed Majid Azimi ; Reza Bahmanyar ; Corenin Henry ; Franz Kurz
HIGHLIGHT: To address this issue, we introduce EAGLE (oriEnted object detection using Aerial imaGery in real-worLd scEnarios), a large-scale dataset for multi-class vehicle detection with object orientation information in aerial imagery.
173, TITLE: A Comparative Study on Polyp Classification using Convolutional Neural Networks
http://arxiv.org/abs/2007.06071
AUTHORS: Krushi Patel ; Kaidong Li ; Ke Tao ; Quan Wang ; Ajay Bansal ; Amit Rastogi ; Guanghui Wang
HIGHLIGHT: In this work, we compare the performance of the state-of-the-art general object classification models for polyp classification.
174, TITLE: Learning to associate detections for real-time multiple object tracking
http://arxiv.org/abs/2007.06041
AUTHORS: Michel Meneses ; Leonardo Matos ; Bruno Prado ; André de Carvalho ; Hendrik Macedo
COMMENTS: 8 pages, 6 figures
HIGHLIGHT: In this study, it is investigated the use of artificial neural networks to learning a similarity function that can be used among detections.
175, TITLE: It Is Likely That Your Loss Should be a Likelihood
http://arxiv.org/abs/2007.06059
AUTHORS: Mark Hamilton ; Evan Shelhamer ; William T. Freeman
HIGHLIGHT: We survey and systematically evaluate how to parameterize and apply likelihood parameters for robust modeling and re-calibration.
176, TITLE: The Impossibility Theorem of Machine Fairness -- A Causal Perspective
http://arxiv.org/abs/2007.06024
AUTHORS: Kailash Karthik S
HIGHLIGHT: In this report, causal perspective to the impossibility theorem of fairness is presented along with a causal goal for machine fairness.
177, TITLE: Improving Maximum Likelihood Training for Text Generation with Density Ratio Estimation
http://arxiv.org/abs/2007.06018
AUTHORS: Yuxuan Song ; Ning Miao ; Hao Zhou ; Lantao Yu ; Mingxuan Wang ; Lei Li
COMMENTS: Accepted to International Conference on Artificial Intelligence and Statistics 2020
HIGHLIGHT: In this paper, we propose{\psi}-MLE, a new training scheme for auto-regressive sequence generative models, which is effective and stable when operating at large sample space encountered in text generation.
==========Updates to Previous Papers==========
1, TITLE: REMIND Your Neural Network to Prevent Catastrophic Forgetting
http://arxiv.org/abs/1910.02509
AUTHORS: Tyler L. Hayes ; Kushal Kafle ; Robik Shrestha ; Manoj Acharya ; Christopher Kanan
COMMENTS: To appear in the European Conference on Computer Vision (ECCV-2020)
HIGHLIGHT: Here, we propose REMIND, a brain-inspired approach that enables efficient replay with compressed representations.
2, TITLE: Syn-QG: Syntactic and Shallow Semantic Rules for Question Generation
http://arxiv.org/abs/2004.08694
AUTHORS: Kaustubh D. Dhole ; Christopher D. Manning
COMMENTS: In Proceedings of the 2020 Annual Conference of the Association for Computational Linguistics (ACL 2020)
HIGHLIGHT: We implement this observation by developing Syn-QG, a set of transparent syntactic rules leveraging universal dependencies, shallow semantic parsing, lexical resources, and custom rules which transform declarative sentences into question-answer pairs.
3, TITLE: Deep Decomposition Learning for Inverse Imaging Problems
http://arxiv.org/abs/1911.11028
AUTHORS: Dongdong Chen ; Mike E. Davies
COMMENTS: To appear in ECCV 2020
HIGHLIGHT: In this paper, inspired by the geometry that data can be decomposed by two components from the null-space of the forward operator and the range space of its pseudo-inverse, we train neural networks to learn the two components and therefore learn the decomposition, \ieB we explicitly reformulate the neural network layers as learning range-nullspace decomposition functions with reference to the layer inputs, instead of learning unreferenced functions.