'AI' 태그의 글 목록

Recent Posts

Recent Comments

« 2026/04 »
일	월	화	수	목	금	토
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30

Today

Total

관리 메뉴

목록AI (6)

꿈꾸는 개발자의 devLog

[Deep Learning 논문 리뷰] Image2StyleGAN : How to Embed Images Into the StyleGAN Latent Space?

"모두의 연구소"의 Advanced Computer Vision 스터디를 통해 공부한 논문 정리입니다.해당 연구는 2019년 ICCV에 게재된 논문임https://arxiv.org/abs/1904.03189 Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?We propose an efficient algorithm to embed a given image into the latent space of StyleGAN. This embedding enables semantic image editing operations that can be applied to existing photographs. Taking the StyleGAN t..

Study/Papers 2024. 5. 28. 01:59

[Deep Learning 논문 리뷰] PWC-Net : CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume

"모두의 연구소"의 Advanced Computer Vision 스터디를 통해 공부한 논문 정리입니다.해당 연구는 2018년 CVPR 에 게재된 논문임 https://arxiv.org/abs/1709.02371 PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost VolumeWe present a compact but effective CNN model for optical flow, called PWC-Net. PWC-Net has been designed according to simple and well-established principles: pyramidal processing, warping, and the use of a cost..

Study/Papers 2024. 5. 20. 01:59

[Deep Learning 논문 리뷰] Mesh R-CNN

'모두의 연구소'라는 플랫폼을 통해서 'Advanced Computer Vision Study'에 스터디원으로 참석중이다.최근에 내가 발표한 논문들이나, 해당 스터디를 통해서 습득한 지식을 블로그에 작성할 것이다..사실 다른 것에 집중하느라 글 쓰는 것에 집중하지 못하였지만, 최대한 시간 되는대로 작성을 해야겠다 https://arxiv.org/abs/1906.02739 Mesh R-CNNRapid advances in 2D perception have led to systems that accurately detect objects in real-world images. However, these systems make predictions in 2D, ignoring the 3D structure ..

Study/Papers 2024. 5. 13. 14:40

[Deep Learning 논문 리뷰] Diffusion Transformer(DiT) : Scalable Diffusion Models with Transformers

ViT 다음으로 선택한 DiT ! 사실 이미지 분류에서는 요즘에는 ViT를 주로 사용하고 있어서, 거의 반복되는 구조인 것같아서 다른 도메인에서의 최신 연구를 확인해보았다. 요즘 생성 AI 때문에 이미지 생성 분야가 뜨고 있어서 Diffusion의 최신 논문은 무엇일까 하는 궁금증에 검색해 보았더니, 역시나 여기도 Transformer!! Diffusion 영역에서는 어떻게 사용되는지 궁금해서 이 논문을 선택했다. 근데, 사실 Diffusion 도메인은 처음 공부하는 것이다. .. 그래서 다른 블로그 글을 많이 참고했다 ㅠㅠ 참고한 기술블로그는 글 아래 하단에 첨부되어 있다. (감사합니다 다들 :) ) 세상에는 똑똑한 사람이 정말 많다,, https://arxiv.org/abs/2212.09748 Sca..

Study/Papers 2024. 4. 3. 02:14

[Deep Learning 논문 리뷰] Vision Transformer(ViT) : An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

팀원 끼리 자체적으로 딥러닝 논문 스터디를 진행 중이다 그래서, 약 3주에 한 번씩 각자 관심 분야 논문에 대해 발표를 진행함 이 글이 여기 블로그의 100번째 글인데, 100번째 글이 첫 논문 리뷰 글이라니 !! 뭔가 좋다 :) 이 기세로 쭉쭉 가보주악 !!!! https://arxiv.org/abs/2010.11929 An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision..

Study/Papers 2024. 1. 19. 17:30

[Pytorch] CIFAR10 이미지 학습 예제

Ref : https://learn.microsoft.com/ko-kr/windows/ai/windows-ml/tutorials/pytorch-data 분류에 대한 PyTorch ML 모델 준비 분류에 대한 Pytorch ML 학습 모델 준비 learn.microsoft.com 위의 링크에서 제시하는 예제를 테스트 하였음 해당 예제를 테스트한 서버의 스펙과 파이토치 버전은 아래와 같음 서버 : NVIDIA GeForce RTX 4090 torch : 1.13.1 torchvision : 0.14.1 python : 3.10 anaconda 환경의 주피터 노트북에서 실행하였음 [Dataset 준비] from torchvision.datasets import CIFAR10 from torchvision.tr..

Computer Science 2023. 11. 29. 15:35

Prev 1 Next

목록AI (6)

꿈꾸는 개발자의 devLog

티스토리툴바