need help in CUDA programming

$10-30 USD

Completed

Posted

over 9 years ago

$10-30 USD

Paid on delivery

please check the attachment for details of the work let me know how fast u can do

Project ID: 6771803

About the project

2 proposals

Remote project

Active 9 yrs ago

Looking to make some money?

Email address

Benefits of bidding on Freelancer

Set your budget and timeframe

Get paid for your work

Outline your proposal

It's free to sign up and bid on jobs

Awarded to:

@vw1249172vw

Hi, I can do exactly what is the requirement, I have enough experience in CUDA developpement, I can even tell you before developing the application, why the fifth option is the fastest. 5) CUDA + No Array transpose + Tiling (the fastest) * Tiling will minimize the costly global memory reading, reading from shared memory is fast. * No Array transpose: For each matrix element we will sum up the multiplication of A matrix columns by B matrix columns (instead of B matrix rows), Reading matrix by columns is faster than reading it by rows, since will have a sequential reads and take advantage of CUDA memory cache. As a proof of concept after developing the application, we will do some CUDA profiling (CUDA Visual Profiler) and get some nice explanations. You will need a Nvidia graphic card & CUDA Toolkit 5.5 (Not necessary the same version bu preferable) My [login to view URL] Output Device 0: "GeForce GTX 660" CUDA Driver Version / Runtime Version 5.5 / 5.5 Regards Marouane

$25 USD in 2 days

5.0

(19 reviews)

5.4

2 freelancers are bidding on average $28 USD for this job

@Mufaddalsaifee

Hi, I have developed this kind of functionality for one of my old client which involves matrix transpose and other operation. Please message me so that i can show you the file. I have developed that software in VBA. Thanks, Mufaddal

$30 USD in 1 day