BARALDI, LORENZO
 Distribuzione geografica
Continente #
NA - Nord America 15.585
AS - Asia 12.028
EU - Europa 10.321
SA - Sud America 1.112
AF - Africa 197
OC - Oceania 68
Continente sconosciuto - Info sul continente non disponibili 23
Totale 39.334
Nazione #
US - Stati Uniti d'America 15.195
IT - Italia 4.583
SG - Singapore 3.374
CN - Cina 3.010
GB - Regno Unito 1.732
HK - Hong Kong 1.413
VN - Vietnam 1.063
DE - Germania 881
BR - Brasile 849
TR - Turchia 822
SE - Svezia 633
KR - Corea 506
FR - Francia 425
FI - Finlandia 361
RU - Federazione Russa 335
ID - Indonesia 315
JP - Giappone 309
IN - India 273
NL - Olanda 233
CA - Canada 218
UA - Ucraina 176
ES - Italia 164
MX - Messico 126
TW - Taiwan 118
BD - Bangladesh 116
AT - Austria 107
IE - Irlanda 104
MY - Malesia 97
BG - Bulgaria 93
TH - Thailandia 90
AR - Argentina 88
IQ - Iraq 82
PL - Polonia 72
BE - Belgio 71
CH - Svizzera 70
PH - Filippine 67
AU - Australia 59
ZA - Sudafrica 56
PK - Pakistan 51
RO - Romania 47
AE - Emirati Arabi Uniti 42
SA - Arabia Saudita 41
LT - Lituania 40
GR - Grecia 38
IL - Israele 38
CL - Cile 37
EC - Ecuador 37
DK - Danimarca 34
PT - Portogallo 30
CO - Colombia 27
KE - Kenya 27
VE - Venezuela 27
UZ - Uzbekistan 26
TN - Tunisia 23
DZ - Algeria 20
JO - Giordania 20
MA - Marocco 20
EU - Europa 19
IR - Iran 19
NP - Nepal 19
CZ - Repubblica Ceca 18
PE - Perù 18
KZ - Kazakistan 17
EG - Egitto 16
AZ - Azerbaigian 13
PY - Paraguay 13
ET - Etiopia 10
SY - Repubblica araba siriana 10
JM - Giamaica 9
OM - Oman 9
BZ - Belize 8
HU - Ungheria 8
KH - Cambogia 8
LU - Lussemburgo 8
NZ - Nuova Zelanda 8
UY - Uruguay 8
AL - Albania 7
BH - Bahrain 7
MO - Macao, regione amministrativa speciale della Cina 7
RS - Serbia 7
SC - Seychelles 7
SK - Slovacchia (Repubblica Slovacca) 7
BO - Bolivia 6
CY - Cipro 6
EE - Estonia 6
HR - Croazia 6
KG - Kirghizistan 6
MD - Moldavia 6
NO - Norvegia 6
DO - Repubblica Dominicana 5
KW - Kuwait 5
SN - Senegal 5
BB - Barbados 4
HN - Honduras 4
LB - Libano 4
LK - Sri Lanka 4
LV - Lettonia 4
PS - Palestinian Territory 4
AM - Armenia 3
BA - Bosnia-Erzegovina 3
Totale 39.278
Città #
Singapore 2.133
Ashburn 1.528
Fairfield 1.362
Santa Clara 1.360
Hong Kong 1.191
Southend 958
Modena 907
Hefei 859
San Jose 795
Chandler 774
Elâzığ 670
Woodbridge 668
Seattle 610
Houston 595
Cambridge 521
Beijing 519
Wilmington 444
Ann Arbor 384
London 349
Seoul 349
Nyköping 342
Ho Chi Minh City 339
Los Angeles 311
Bologna 289
Milan 283
Jakarta 261
Dearborn 247
Jacksonville 236
Helsinki 235
Hanoi 229
The Dalles 202
New York 196
Chicago 194
Buffalo 193
Rome 187
Council Bluffs 172
Boardman 160
Tokyo 150
Reggio Emilia 140
San Diego 135
Lauterbourg 128
Munich 127
Parma 112
Nuremberg 100
Shanghai 96
Princeton 89
Sofia 87
Amsterdam 84
Bangkok 82
São Paulo 81
Frankfurt am Main 79
Redwood City 79
Orem 78
Dublin 75
Izmir 73
Kent 73
Dong Ket 69
Dallas 67
Montreal 67
Phoenix 65
Moscow 63
Florence 61
Salt Lake City 60
Eugene 59
Mexico City 59
Chennai 55
Pisa 55
Taipei 54
Naples 53
Bomporto 50
Bremen 49
Vienna 48
Da Nang 47
Haiphong 47
Manchester 47
Paris 47
Kuala Selangor 46
Formigine 45
Warsaw 45
Falkenstein 44
Toronto 44
Zurich 44
Manila 42
Brussels 41
Piacenza 39
Atlanta 37
Fremont 36
Lappeenranta 35
Ottawa 34
Guangzhou 33
Johannesburg 33
Turin 33
Falls Church 32
Trento 32
Wilmette 32
Tampa 31
Düsseldorf 30
Nanjing 29
Rio de Janeiro 29
San Francisco 29
Totale 24.917
Nome #
What was Monet seeing while painting? Translating artworks to photo-realistic images 647
Spaghetti Labeling: Directed Acyclic Graphs for Block-Based Connected Components Labeling 636
Connected Components Labeling on DRAGs 593
MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models 573
Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era 565
Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach 546
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models 506
Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era 498
Towards Cycle-Consistent Models for Text and Image Retrieval 492
Artpedia: A New Visual-Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain 473
Modeling Multimodal Cues in a Deep Learning-based Framework for Emotion Recognition in the Wild 465
Automatic Image Cropping and Selection using Saliency: an Application to Historical Manuscripts 442
YACCLAB - Yet Another Connected Components Labeling Benchmark 431
Connected Components Labeling on DRAGs: Implementation and Reproducibility Notes 430
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions 419
Layout analysis and content classification in digitized books 411
Learning to Read L'Infinito: Handwritten Text Recognition with Synthetic Training Data 405
Aligning Text and Document Illustrations: towards Visually Explainable Digital Humanities 404
M-VAD Names: a Dataset for Video Captioning with Naming 400
A Deep Multi-Level Network for Saliency Prediction 397
Explaining Digital Humanities by Aligning Images and Textual Descriptions 396
Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions 391
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model 387
A Hierarchical Quasi-Recurrent approach to Video Captioning 386
A Browsing and Retrieval System for Broadcast Videos using Scene Detection and Automatic Annotation 382
Recognizing social relationships from an egocentric vision perspective 379
A Video Library System Using Scene Detection and Automatic Tagging 374
A Deep Siamese Network for Scene Detection in Broadcast Videos 371
Optimized Connected Components Labeling with Pixel Prediction 371
Historical Document Digitization through Layout Analysis and Deep Content Classification 366
Analysis and Re-use of Videos in Educational Digital Libraries with Automatic Scene Detection 364
Shot and Scene Detection via Hierarchical Clustering for Re-using Broadcast Video 360
Context Change Detection for an Ultra-Low Power Low-Resolution Ego-Vision Imager 359
Hierarchical Boundary-Aware Neural Encoder for Video Captioning 359
Image-to-Image Translation to Unfold the Reality of Artworks: an Empirical Analysis 359
Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis 358
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation 357
Hand Segmentation for Gesture Recognition in EGO-Vision 354
Dual-Branch Collaborative Transformer for Virtual Try-On 352
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation 351
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs 347
Recognizing and Presenting the Storytelling Video Structure with Deep Multimodal Networks 346
The Revolution of Multimodal Large Language Models: A Survey 341
Measuring scene detection performance 340
SAM: Pushing the Limits of Saliency Prediction Models 339
Gesture Recognition using Wearable Vision Sensors to Enhance Visitors' Museum Experiences 333
Visual Saliency for Image Captioning in New Multimedia Services 333
Ai4ar: An ai-based mobile application for the automatic generation of ar contents 333
SynthCap: Augmenting Transformers with Synthetic Data for Image Captioning 333
Multi-Level Net: a Visual Saliency Prediction Model 331
LAMV: Learning to align and match videos with kernelized temporal layers 327
Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation 326
Explore and Explain: Self-supervised Navigation and Recounting 315
From Show to Tell: A Survey on Deep Learning-based Image Captioning 315
A Novel Attention-based Aggregation Function to Combine Vision and Language 314
Tracing Information Flow in LLaMA Vision: A Step Toward Multimodal Understanding 308
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention 304
Multimodal Attention Networks for Low-Level Vision-and-Language Navigation 303
Scene segmentation using temporal clustering for accessing and re-using broadcast video 299
Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features 299
Towards Video Captioning with Naming: a Novel Dataset and a Multi-Modal Approach 297
Meshed-Memory Transformer for Image Captioning 292
Towards Reliable Experiments on the Performance of Connected Components Labeling Algorithms 290
CaMEL: Mean Teacher Learning for Image Captioning 284
Embodied Agents for Efficient Exploration and Smart Scene Description 281
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities 272
A Computational Approach for Progressive Architecture Shrinkage in Action Recognition 271
A Unified Cycle-Consistent Neural Model for Text and Image Retrieval 265
A Deep-learning-based approach to VM behavior Identification in Cloud Systems 263
Retrieval-Augmented Transformer for Image Captioning 261
Boosting Modern and Historical Handwritten Text Recognition with Deformable Convolutions 259
Video action detection by learning graph-based spatio-temporal interactions 258
Investigating Bidimensional Downsampling in Vision Transformer Models 257
NeuralStory: an Interactive Multimedia System for Video Indexing and Re-use 256
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval 254
Adapt to Scarcity: Few-Shot Deepfake Detection via Low-Rank Adaptation 246
Focus on Impact: Indoor Exploration with Intrinsic Motivation 239
Learning to Select: A Fully Attentive Approach for Novel Object Captioning 238
Intelligent Multimodal Artificial Agents that Talk and Express Emotions 237
Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters 237
Semantically Conditioned Prompts for Visual Recognition under Missing Modality Scenarios 236
Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis 233
Embodied Navigation at the Art Gallery 232
Hyperbolic Safety-Aware Vision-Language Models 232
Assessing the Role of Boundary-level Objectives in Indoor Semantic Segmentation 231
The Unreasonable Effectiveness of CLIP features for Image Captioning: an Experimental Analysis 231
RMS-Net: Regression and Masking for Soccer Event Spotting 229
Improving Indoor Semantic Segmentation with Boundary-level Objectives 229
Are Learnable Prompts the Right Way of Prompting? Adapting Vision-and-Language Models with Memory Optimization 225
BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues 222
Matching Faces and Attributes Between the Artistic and the Real Domain: the PersonArt Approach 218
ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval 217
Estimating (and fixing) the Effect of Face Obfuscation in Video Recognition 215
Multimodal Emotion Recognition in Conversation via Possible Speaker's Audio and Visual Sequence Selection 211
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning 211
The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition 211
Towards Explainable Navigation and Recounting 210
FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval 210
Working Memory Connections for LSTM 208
Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates 202
Totale 33.265
Categoria #
all - tutte 131.089
article - articoli 0
book - libri 0
conference - conferenze 0
curatela - curatele 0
other - altro 0
patent - brevetti 0
selected - selezionate 0
volume - volumi 0
Totale 131.089


Totale Lug Ago Sett Ott Nov Dic Gen Feb Mar Apr Mag Giu
2020/2021521 0 0 0 0 0 0 0 0 0 0 279 242
2021/20223.237 183 139 240 169 84 214 186 230 311 335 825 321
2022/20232.767 364 299 246 227 301 283 92 226 386 75 144 124
2023/20242.525 243 170 246 312 439 164 120 183 70 201 131 246
2024/20258.439 710 242 260 496 1.229 925 512 650 1.081 567 803 964
2025/202614.228 1.084 780 1.263 1.528 2.134 1.059 1.930 1.297 1.295 1.683 175 0
Totale 39.958